WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

A01H 5/00, 5/10, C12N 15/52, 15/82 



Al 



(11) International Publication Number: WO 97/30582 

(43) International Publication Date: 28 August 1997 (28.08.97) 



(21) International Application Number: PCT/US97/02187 

(22) International Filing Date: 6 February 1997 (06.02.97) 



(30) Priority Data: 

08/597,313 



6 February 1996 (06.02.96) 



US 



(60) Parent Application or Grant 

(63) Related by Continuation 
US 

Filed on 



08/597,313 (CON) 
6 February 1996 (06.02.96) 



(71) Applicants (for all designated Stales except US)z CARNEGIE 

INSTITUTION OF WASHINGTON [US/US]; 1530 P. 
Street, N.W., Washington, DC 20016 (US). MONSANTO 
COMPANY, INC. [US/US]; 700 Chesterfield Parkway 
North, St. Louis, MO 63198 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): BROUN, Pierre [FR7USJ; 

1249 Capuchino, Burlingame, CA 94010 (US). VAN 
DE LOO, Frank [AU/AU1; 11 Fowles Street, Weston, 
ACT 2611 (AU). BODDUPALLI, Sekhar, S. [IN/US]; 
572 Enchanted Parkway, Manchester, MO 63021 (US). 



SOMERVILLE, Chris [US/US]; 5 Valley Oak, Portola 
Valley, CA 94028 (US). 

(74) Agents: KOKULIS, Paul, N. et al.; Cushman Darby & 
Cushman, Intellectual Property Group of Pillsbury Madison 
& Sutro, 1100 New York Avenue, N.W., Washington, DC 
20005 (US). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB. BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GE, 
HU, IL, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, 
LT, LU, LV, MD, MG, MK, MN, MW, MX, NO, NZ, PL, 
PT, RO, RU, SD, SE, SG, SI, SK, TJ, TM, TR, TT, UA, 
UG, US, UZ r VN, ARIPO patent (KE, LS, MW, SD, SZ, 
UG), Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, 
TM), European patent (AT, BE, CH, DE, DK, ES, FI, FR, 
GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI patent (BF, 
BJ, CF, CG, CI, CM, GA, GN, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: PRODUCTION OF HYDROXYLATED FATTY ACIDS IN GENETICALLY MODIFIED PLANTS 



(57) Abstract 



This invention relates to plant fatty acid hydroxylases. Methods to use conserved amino acid or nucleotide sequences to obtain plant 
fatty acid hydroxylases are described. Also described is the use of cDNA clones encoding a plant hydroxylase to produce a family of 
hydroxylated fatty acids in transgenic plants. In addition, the use of genes encoding fatty acid hydroxylases or desaturases to alter the level 
of lipid fatty acid unsaturation in transgenic plants is described. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AM 


Armenia 


GB 


United Kingdom 


MW 


Malawi 


AT 


Austria 


GE 


Georgia 


MX 


Mexico 


AU 


Australia 


GN 


Guinea 


NE 


Niger 


BB 


Barbados 


GR 


Greece 


NL 


Netherlands 


BE 


Belgium 


HU 


Hungary 


NO 


Norway 


BF 


Burkina Faso 


IE 


Ireland 


NZ 


New Zealand 


BG 


Bulgaria 


IT 


Italy 


PL 


Poland 


BJ 


Benin 


JP 


Japan 


PT 


Portugal 


BR 


Brazil 


KE 


Kenya 


RO 


Romania 


BY 


Belarus 


KG 


Kyrgystan 


RU 


Russian Federation 


CA 


Canada 


KP 


Democratic People's Republic 


SD 


Sudan 


CF 


Central African Republic 




of Korea 


SE 


Sweden 


CG 


Congo 


KR 


Republic of Korea 


SG 


Singapore 


CH 


Switzerland 


KZ 


Kazakhstan 


SI 


Slovenia 


CI 


Cdte d*I voire 


LI 


Liechtenstein 


SK 


Slovakia 


CM 


Cameroon 


LK 


Sri Lanka 


SN 


Senegal 


CN 


China 


LR 


Liberia 


sz 


Swaziland 


CS 


Czechoslovakia 


LT 


Lithuania 


TD 


Chad 


CZ 


Czech Republic 


LU 


Luxembourg 


TG 


Togo 


DE 


Germany 


LV 


Latvia 


TJ 


Tajikistan 


DK 


Denmark 


MC 


Monaco 


TT 


Trinidad and Tobago 


EE 


Estonia 


MD 


Republic of Moldova 


UA 


Ukraine 


ES 


Spain 


MC 


Madagascar 


UG 


Uganda 


FI 


Finland 


ML 


Mali 


US 


United States of America 


FR 


France 


MN 


Mongolia 


uz 


Uzbekistan 


GA 


Gabon 


MR 


Mauritania 


VN 


Viet Nam 



WO 97/30582 



PCT/US97/02187 



1 



PRODUCTION OF HYDROXYLATED PATTY ACIDS 
IN GENETICALLY MODIFIED PLANTS 

TECHNICAL FIELD 

The present invention concerns the 
5 identification of nucleic acid sequences and 

constructs, and methods related thereto, and the use 
of these sequences and constructs to produce 
genetically modified plants for the purpose of 
altering the fatty acid composition of plant oils, 
10 waxes and related compounds. 

DEFINITIONS 

The subject of this invention is a class of 
enzymes that introduce a hydroxyl group into several 
different fatty acids resulting in the production of 

15 several different kinds of hydroxylated fatty acids. 
In particular, these enzymes catalyze hydroxylation 
of oleic acid to 12 -hydroxy oleic acid and icosenoic 
acid to 14 -hydroxy icosenoic acid. Other fatty acids 
such as palmitoleic and erucic acids may also be 

20 substrates. Since it is not possible to refer to the 
enzyme by reference to a unique substrate or 
product, the enzyme is referred throughout as kappa 
hydroxylase to indicate that the enzyme introduces 
the hydroxyl three carbons distal (i.e., away from 

2 5 the carboxyl carbon of the acyl chain) from a double 
bond located near the center of the acyl chain. 

The following fatty acids are also the 
subject of this invention: ricinoleic acid, 12- 
hydroxyoctadec~cis-9-enoic acid (120H-18 : l cisA9 ) ; 

30 lesquerolic acid, 14 -hydroxy- cis-11- icosenoic acid 
(14OH-20:l cisA11 ) ; densipolic acid, 12 -hydroxyoctadec- 
cis-9 , 15-dienoic acid ( 120H-18 : 2 ciBA9 ' 15 ) ; auricolic 
acid, 1.4-hydroxy-ris-ll , 17 -icosadienoic acid (140H- 
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20:2 cisA11 - 17 ) ; hydroxyerucic, 16-hydroxydocos-cis-13- 
enoic acid ( 160H-22 : l ci * A13 ) ; hydroxypalmitoleic, 12- 
hydroxyhexadec-cis-9-enoic (120H-16 : l ci "*») ; icosenoic 
acid (20:l ciaA11 ) . It will be noted that icosenoic acid 
5 is spelled eicosenoic acid in some countries. 

BACKGROUND 

Extensive surveys of the fatty acid 
composition of seed oils from different species of 
higher plants have resulted in the identification of 

10 at least 33 structurally distinct monohydroxylated 

plant fatty acids, and 12 different polyhydroxylated 
fatty acids that are accumulated by one or more 
plant species (reviewed by van de Loo et al., 1993) . 
Ricinoleic acid, the principal constituent of the 

15 seed oil from the castor plant Riclnus communis 
(L.), is of commercial importance. The present 
inventors have cloned a gene from this species that 
encodes a fatty acid hydroxylase, and have used this 
gene to produce ricinoleic acid in transgenic plants 

20 of other species. Some of this scientific evidence 
has been published by the present inventors (van de 
Loo et al. , 1995) . 

The use of the castor hydroxylase gene to 
also produce other hydroxylated fatty acids such as 

2 5 lesquerolic acid, densipolic acid, 

hydroxypalmitoleic , hydroxyerucic and auricolic acid 
in transgenic plants is the subject of this 
invention. In addition, the identification of a gene 
encoding a homologous hydroxylase from Lesquerella 

3 0 fendleri, and the use of this gene to produce these 

hydroxylated fatty acids in transgenic plants is the 
subject of this invention. 
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Castor is a minor oilseed crop. Approximately 
50% of the seed weight is oil ( triacylglycerol) in 
which 85-90% of total fatty acids are the 
hydroxylated fatty acid, ricinoleic acid. Oil 
5 pressed or extracted from castor seeds has many 

industrial uses based upon the properties endowed by 
the hydroxylated fatty acid. The most important uses 
are production of paints and varnishes, nylon- type 
synthetic polymers, resins, lubricants, and 

10 cosmetics (Atsmon, 1989) . 

In addition to oil, the castor seed contains 
the extremely toxic protein ricin, allergenic 
proteins, and the alkaloid ricinine. These 
constituents preclude the use of the untreated seed 

15 meal (following oil extraction) as a livestock feed, 
normally an important economic aspect of oilseed 
utilization. Furthermore, with the variable nature 
of castor plants and a lack of investment in 
breeding, castor has few favorable agronomic 

20 characteristics . 

For a combination of these reasons, castor is 
no longer grown in the United States and the 
development of an alternative domestic source of 
hydroxylated fatty acids would be attractive. The 

25 production of ricinoleic acid, the important 

constituent of castor oil, in an established oilseed 
crop through genetic engineering would be a 
particularly effective means of creating a domestic 
source . 

30 Because there is no practical source of 

lesquerolic, densipolic and auricolic acids from 
plants that are adapted to modern agricultural 
practices, there is currently no large-scale use of 
these fatty acids by industry. However, the fatty 
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acids would have uses similar to those of ricinoleic 
acid if they could be produced in large quantities 
at comparable cost to other plant -derived fatty 
acids (Smith, 1985) . 

Plant species, such as certain species in the 
genus Lesquerella, that accumulate a high proportion 
of these fatty acids, have not been domesticated and 
are not currently considered a practical source of 
fatty acids (Hirsinger, 1989) . This invention 
represents a useful step toward the eventual 
production of these and other hydroxylated fatty 
acids in transgenic plants of agricultural 
importance . 

The taxonomic relationships between plants 
15 having similar or identical kinds of unusual fatty 
acids have been examined (van de Loo et al . , 1993) 
In some cases, particular fatty acids occur mostly 
or solely in related taxa. In other cases there does 
not appear to be a direct link between taxonomic 
20 relationships and the occurrence of unusual fatty 

acids. In this respect, ricinoleic acid has now been 
identified in 12 genera from 10 families (reviewed 
in van de Loo et al . , 1993). Thus, it appears that 
the ability to synthesize hydroxylated fatty acids 
2 5 has evolved several times independently during the 
radiation of the angiosperms . This suggested to us 
that the enzymes which introduce hydroxyl groups 
into fatty acids arose by minor modifications of a 
related enzyme. 
30 Indeed, as shown herein, the sequence 

similarity between A12 fatty acid desaturases and 
the kappa hydroxylase from castor is so high that it 
is not possible to unambiguously determine whether a 
particular enzyme is a desaturase or a hydroxylase 
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on the basis of evidence in the scientific 
literature. Similarly, a patent application (PCT WO 
94/11516) that purports to teach the isolation and 
use of A12 fatty acid desaturases does not teach how 
to distinguish a hydroxylase from a desaturase. In 
view of the importance of being able to distinguish 
between these activities for the purpose of genetic 
engineering of plant oils, the utility of that 
application is limited to the several instances 
where direct experimental evidence (e.g., altered 
fatty acid composition in transgenic plants) was 
presented to support the assignment of function. A 
method for distinguishing between fatty acid 
desaturases and fatty acid hydroxylases on the basis 
of amino acid sequence of the enzyme is also a 
subject of this invention. 

A feature of hydroxylated or other unusual 
fatty acids is that they are generally confined to 
seed triacylglycerols, being largely excluded from 
the polar lipids by unknown mechanisms (Battey and 
Ohlrogge 1989; Prasad et al . , 1987). This is 
particularly intriguing since diacylglycerol is a 
precursor of both triacylglycerol and polar lipid. 
With castor microsomes, there is some evidence that 
the pool of ricinoleoyl-containing polar lipid is 
minimized by a preference of diacylglycerol 
acyltransf erase for ricinoleate-containing 
diacylglycerol s (Bafor et al . , 1991). Analyses of 
vegetative tissues have generated few reports of 
unusual fatty acids, other than those occurring in 
the cuticle. The cuticle contains various 
hydroxylated fatty acids which are interesterif ied 
to produce a high molecular weight polyester which 
serves a structural role. A small number of other 
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exceptions exist in which unusual fatty acids are 
found in tissues other than the seed. 

The biosynthesis of ricinoleic acid from 
oleic acid in the developing endosperm of castor 
5 (Ricinus communis) has been studied by a variety of 
methods. Morris (1967) established in double- 
labeling studies that hydroxylation occurs directly 
by hydroxyl substitution rather than via an 
unsaturated-, keto- or epoxy- intermediate . 

10 Hydroxylation using oleoyl-CoA as precursor can be 
demonstrated in crude preparations or microsomes, 
but activity in microsomes is unstable and variable, 
and isolation of the microsomes involved a 
considerable, or sometimes complete loss of activity 

15 (Galliard and Stumpf, 1966; Moreau and Stumpf, 
1981) . Oleic acid can replace oleoyl-CoA as a 
precursor, but only in the presence of CoA, Mg 2 * and 
ATP (Galliard and Stumpf, 1966) indicating that 
activation to the acyl-CoA is necessary. However, no 

20 radioactivity could be detected in ricinoleoyl -CoA 
(Moreau and Stumpf, 1981) . These and more recent 
observations (Bafor et al. f 1991) have been 
interpreted as evidence that the substrate for the 
castor oleate hydroxylase is oleic acid esterified 

25 to phosphatidylcholine or another phospholipid. 

The hydroxylase is sensitive to cyanide and 
azide, and dialysis against metal chelators reduces 
activity, which could be restored by addition of 
FeS0 4/ suggesting iron involvement in enzyme activity 

30 (Galliard and Stumpf, 1966). Ricinoleic acid 

synthesis requires molecular oxygen (Galliard and 
Stumpf, 1966; Moreau and Stumpf 1981) and requires 
NAD ( P ) H to reduce cytochrome b5 which is thought to 
be the intermediate electron donor for the 
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hydroxylase reaction (Smith et al . , 1992). Carbon 
monoxide does not inhibit hydroxylation, indicating 
that a cytochrome P4 5 0 is not involved (Galliard and 
Stumpf , 1966; Moreau and Stumpf 1981) . Data from a 
5 study of the substrate specificity of the 

hydroxylase show that all substrate parameters 
(i.e., chain length and double bond position with 
respect to both ends) are important; deviations in 
these parameters caused reduced activity relative to 

10 oleic acid (Howling et al . , 1972). The position at 
which the hydroxy 1 was introduced, however, was 
determined by the position of the double bond, 
always being three carbons distal. Thus, the castor 
acyl hydroxylase enzyme can produce a family of 

15 different hydroxylated fatty acids depending on the 
availability of substrates. Thus, as a matter of 
convenience, the enzyme is referred throughout this 
specification as a kappa hydroxylase (rather than an 
oleate hydroxylase) to indicate the broad substrate 

20 specificity. 

The castor kappa hydroxylase has many 
superficial similarities to the microsomal fatty 
acyl desaturases (Browse and Somerville, 1991) . In 
particular, plants have a microsomal oleate 

25 desaturase active at the A12 position. The substrate 
of this enzyme (Schmidt et al . , 1993) and of the 
hydroxylase (Bafor et al., 1991) appears to be a 
fatty acid esterified to the sn-2 position of 
phosphatidylcholine. When oleate is the substrate, 

3 0 the modification occurs at the same position (A12) 
in the carbon chain, and requires the same 
cof actors, namely electrons from NADH via cytochrome 
b 5 and molecular oxygen. Neither enzyme is inhibited 
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by carbon monoxide (Moreau and Stumpf , 1981) , the 
characteristic inhibitor of cytochrome P450 enzymes. 

There do not appear to have been any 
published biochemical studies of the properties of 
5 the hydroxylase enzyme (s) in Lesquerella. 

Concep tual basis of the invention 

The present inventors have described the use 
of a cDNA clone from castor for the production of 
ricinoleic acid in transgenic plants. As noted 

10 above, biochemical studies had suggested that the 
castor hydroxylase may not have strict specificity 
for oleic acid but would also catalyze hydroxylation 
of other fatty acids such as icosenoic acid 
(20:l cisA11 ) (Howling et al., 1972). Based on these 

15 studies, expression of kappa hydroxylase in 

transgenic plants of species such as Brassica napus 
and Arabidopsis thaliana that accumulate fatty acids 
such as icosenoic acid (20 : l ciaA11 ) and erucic acid 
(13-docosenoic acid; 22:l ciaA13 ) may cause the 

2 0 accumulation of hydroxylated derivatives of these 
fatty acids due to the activity of the hydroxylase 
on these fatty acids. Direct evidence is presented 
in Example 1 that hydroxlyated derivatives of 
ricinoleic, lesquerolic, densipolic and auricolic 

2 5 fatty acids are produced in transgenic Arabidopsis 
plants . 

Example 2 shows the isolation of a novel 
kappa hydroxylase gene from Lesquerella fendleri. 

In view of the high degree of sequence 
30 similarity between L12 fatty acid desaturases and 
the castor hydroxylase (van de Loo et al., 1995), 
the validity of claims (e.g., PCT WO 94/11516) for 
using a limited set of desaturase or hydroxylase 
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genes or sequences derived therefrom to identify 
genes of identical function from other species must 
be viewed with skepticism. In this application, the 
present inventors teach a method by which 
5 hydroxylase genes can be distinguished from 
desaturases . The present inventors describe a 
mechanistic basis for the similar reaction 
mechanisms of desaturases and hydroxylases. Briefly, 
the available evidence suggests that fatty acid 

10 desaturases have a similar reaction mechanism to the 
bacterial enzyme methane monooxygenase which 
catalyses a reaction involving oxygen- atom transfer 
(CH 4 -* CH 3 OH) (van de Loo et al., 1993) . The cofactor 
in the hydroxylase component of methane 

15 monooxygenase is termed a /z-oxo bridged diiron 

cluster (FeOFe) . The two iron atoms of the FeOFe 
cluster are liganded by protein-derived nitrogen or 
oxygen atoms, and are tightly redox- coupled by the 
covalently-bridging oxygen atom. The FeOFe cluster 

2 0 accepts two electrons, reducing it to the diferrous 
state, before oxygen binding. Upon oxygen binding, 
it is likely that heterolytic cleavage also occurs, 
leading to a high valent oxoiron reactive species 
that is stabilized by resonance rearrangements 

2 5 possible within the tightly coupled FeOFe cluster. 

The stabilized high-valent oxoiron state of methane 
monooxygenase is capable of proton extraction from 
methane, followed by oxygen transfer, giving 
methanol . The FeOFe cofactor has been shown to be 

3 0 directly relevant to plant fatty acid modifications 

by the demonstration that castor stearoyl-ACP 
desaturase contains this type of cofactor (Fox et 
al . , 1993) . 
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On the basis of the foregoing considerations, 
the present inventors suggest that the castor oleate 
hydroxylase might be a structurally modified fatty 
acyl desaturase, based upon three arguments. The 
first argument involves the taxonomic distribution 
of plants containing ricinoleic acid. Ricinoleic 
acid has been found in 12 genera of 10 families of 
higher plants (reviewed in van de Loo et al . , 1993). 
Thus, plants in which ricinoleic acid occurs are 
found throughout the plant kingdom, yet close 
relatives of these plants do not contain the unusual 
fatty acid. This pattern suggests that the ability 
to synthesize ricinoleic acid has arisen (and been 
lost) several times independently, and is therefore 
has recently diverged. In other words, the ability 
to synthesize ricinoleic acid has evolved rapidly, 
suggesting that a relatively minor genetic change in 
the structure of the ancestral enzyme was necessary 
to accomplish it. 

The second argument is that many biochemical 
properties of castor kappa hydroxylase are similar 
to those of the microsomal desaturases, as discussed 
above (e.g., both preferentially act on fatty acids 
esterified to the sn-2 position of 

phosphatidylcholine, both use cytochrome b5 as an 
intermediate electron donor, both are inhibited by 
cyanide, both require molecular oxygen as a 
substrate, both are thought to be located in the 
endoplasmic reticulum) . 

The third argument stems from the discussion 
of oxygenase cof actors above, in which it is 
suggested that the plant membrane bound fatty acid 
desaturases may have a /i-oxo bridged diiron cluster - 
type cof actor, and that such cof actors are capable 



of catalyzing both fatty acid desaturat ions and 
hydroxylations, depending upon the electronic and 
structural properties of the protein active site. 

Taking these three arguments together, the 
present inventors suggest that kappa hydroxylase of 
castor endosperm is homologous to the microsomal 
oleate A12 desaturase found in all plants. A number 
of genes encoding microsomal A12 desaturases from 
various species have recently been cloned (Okuley et 
al., 1994) and substantial information about the 
structure of these enzymes is now known (Shanklin et 
al., 1994}. Hence, in the following invention, the 
present inventors teach how to use structural 
information to isolate and identify kappa 
hydroxylase genes. This example teaches the method 
by which any carbon -monoxide insensitive plant fatty 
acyl hydroxylase gene can be identified by one 
skilled in the art. 

An unpredicted outcome of our studies on the 
castor hydroxylase gene in transgenic Arabidopsis 
plants was the discovery that expression of the 
hydroxylase leads to increased accumulation of oleic 
acid in seed lipids. Because of the low nucleotide 
sequence homology between the castor hydroxylase and 
the A12 -desaturase (about 67%) , it is unlikely that 
this effect is due to silencing (also called sense- 
suppression or cosuppression) of the expression of 
the desaturase gene by the hydroxylase gene. 
Whatever the basis for the effect, this invention 
teaches the use of hydroxylase genes to alter the 
level of fatty acid unsaturation in transgenic 
plants. This invention also teaches the use of 
genetically modified hydroxylase and desaturase 
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genes to achieve directed modification of fatty acid 
unsaturation levels. 

BRIEF DESCRIPTION OF THE DRAW TITOS 

Figures 1A-D show the mass spectra of hydroxy 
5 fatty acids standards (Figure 1A, O-TMS- 

methylricinoleate; Figure IB, O-TMS -methyl 
densipoleate; Figure 1C, O-TMS-methyl-lesqueroleate ; 
and Figure ID, O-TMS-methylauricoleate) . 

Figure 2 shows the fragmentation pattern of 
10 trimethylsilylated methyl esters of hydroxy fatty 
acids . 

Figure 3A shows the gas chromatogram of fatty 
acids extracted from seeds of wild type Arabidopsis 
plants. Figure 3B shows the gas chromatogram of 

15 fatty acids extracted from seeds of transgenic 

Arabidopsis plants containing the fahl2 hydroxylase 
gene . The numbers indicate the following fatty 
acids: [1] 16:0; [2] 18:0; [3] 18:lcisA9; [4] 
18:2 ci8A9 ' 12 ; [5] 20:0; [6] 20:l clsA11 ; [7] 1 8 : 3 ci8A9 ' 12 ' 15 ; 

20 [8] 20:2 ci * A11 ' 14 ; [9] 22:l CisA1 > ; [10] ricinoleic acid; 
[11] densipolic acid; [12] lesquerolic acid; and 
[13] auricolic acid. 

Figures 4A-D show the mass spectra of novel 
fatty acids found in seeds of transgenic plants. 

25 Figure 4A shows the mass spectrum of peak 10 from 

Figure 3B. Figure 4B shows the mass spectrum of peak 
11 from Figure 3B. Figure 4C shows the mass spectrum 
of peak 12 from Figure 3B. Figure 4D shows the mass 
spectrum of peak 13 from Figure 3B. 

3 0 Figure 5 shows the nucleotide sequence of 

pLesq2 (SEQ ID NO:l) . 

Figure 6 shows the nucleotide sequence of 
pLesq3 (SEQ ID NO: 2) . 
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Figure 7 shows a Northern blot of total RNA 
from seeds of L. fendleri probed with pLesq2 or 
pLesq3 . S, indicates RNA is from seeds; L, indicates 
RNA is from leaves. 
5 Figures 8A-B show the nucleotide sequence of 

genomic clone encoding pLesq-HYD (SEQ ID NO: 3), and 
the deduced amino acid sequence of hydroxylase 
enzyme encoded by the gene (SEQ ID NO: 4) . 

Figures 9A-B show multiple sequence alignment 

10 of deduced amino acid sequences for kappa 

hydroxylases and microsomal A12 desaturases. 
Abbreviations are: Rcfahl2, fahl2 hydroxylase gene 
from R . communis (van de Loo et al . , 1995); Lffahl2, 
kappa hydroxylase gene from L. fendleri; Atfad2, 

15 fad2 desaturase from Arabidopsis thaliana (Okuley et 
al., 1994); Gmfad2-1, fad2 desaturase from Glycine 
max (GenBank accession number L43920) ; Gmfad2-2, 
fad2 desaturase from Glycine max (Genbank accession 
number L43921) ; Zmf ad2 , fad2 desaturase from Zea 

20 mays (PCT WO 94/11516); Rcfad2, fragment of fad2 
desaturase from R . communis (PCT WO 94/11516) ; 
Bnfad2, fad2 desaturase from Brasaica napus (PCT WO 
94/11516); LFFAH1 2 . AMI , SEQ ID NO : 4 ; FAH12.AMI, SEQ 
ID NO: 5; ATFAD2 .AMI , SEQ ID NO : 6 ; BNFAD2 . AMI , SEQ ID 

25 N0:7; GMFAD2 - 1 . AMI , SEQ ID NO:8; GMFAD2 -2 . AMI , SEQ 
ID NO: 9; ZMFAD2 .AMI , SEQ ID NO: 10; and RCFAD2 . AMI , 
SEQ ID NO: 11. 

Figure 10 shows a Southern blot of genomic 
DNA from L. fendleri probed with pLesq-HYD . E = 

3 0 EcoRI, H = Hindi I I , X = Xbal . 

Figure 11 shows a map of binary Ti plasmid 
pSL*J44 024 . 

Figure 12 shows a map of plasmid pYES2.0 
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Figure 13 shows part of a gas chromatogram of 
derivatized fatty acids from yeast cells that 
contain plasmid pLesqYes in which expression of the 
hydroxylase gene was induced by addition of 
5 galactose to the growth medium. The arrow points to 
a peak that is not present in uninduced cells. The 
lower part of the figure is the mass spectrum of the 
peak indicated by the arrow. 

SUMMARY OF T HE INVENTION 

10 This invention relates to plant fatty acyl 

hydroxylases. Methods to use conserved amino acid or 
nucleotide sequences to obtain plant fatty acyl 
hydroxylases are described. Also described is the 
use of cDNA clones encoding a plant hydroxylase to 

15 produce a family of hydroxylated fatty acids in 
transgenic plants. 

In a first embodiment, this invention is 
directed to recombinant DNA constructs which can 
provide for the transcription, or transcription and 

20 translation (expression) of the plant kappa 

hydroxylase sequence. In particular, constructs 
which are capable of transcription, or transcription 
and translation in plant host cells are preferred. 
Such constructs may contain a variety of regulatory 

25 regions including transcriptional initiation regions 
obtained from genes preferentially expressed in 
plant seed tissue. In a second aspect, this 
invention relates to the presence of such constructs 
in host cells, especially plant host cells which 

3 0 have an expressed plant kappa hydroxylase therein. 

In yet another aspect, this invention relates 
to a method for producing a plant kappa hydroxylase 
in a host cell or progeny thereof via the expression 
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of a construct in the cell. Cells containing a plant 
kappa hydroxylase as a result of the production of 
the plant kappa hydroxylase encoding sequence are 
also contemplated herein. 
5 in another embodiment, this invention relates 

to methods of using a DNA sequence encoding a plant 
kappa hydroxylase for the modification of the 
proportion of hydroxylated fatty acids produced 
within a cell, especially plant cells. Plant cells 

10 having such a modified hydroxylated fatty acid 
composition are also contemplated herein. 

In a further aspect of this invention, plant 
kappa hydroxylase proteins and sequences which are 
related thereto, including amino acid and nucleic 

15 acid sequences, are contemplated. Plant kappa 
hydroxylase exemplified herein includes a 
Lesquerella. fendleri fatty acid hydroxylase. This 
exemplified fatty acid hydroxylase may be used to 
obtain other plant fatty acid hydroxylases of this 

2 0 invention. 

In a further aspect of this invention, a 
nucleic acid sequence which directs the seed 
specific expression of an associated polypeptide 
coding sequence is described. The use of this 

25 nucleic acid sequence or fragments derived 

therefrom, to obtain seed-specific expression in 
higher plants of any coding sequence is contemplated 
herein. 

In a further aspect of this invention, the 
30 use of genes encoding fatty acyl hydroxylases of 

this invention are used to alter the amount of fatty 
acid unsaturation of seed lipids. The present 
invention further discloses the use of genetically 
modified hydroxylase and desaturase genes to achieve 
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directed modification of fatty acid unsaturation 
levels . 

DETAILED DESCRIPTION OF THE INVENTION 

A genetically transformed plant of the 
5 present invention which accumulates hydroxylated 
fatty acids can be obtained by expressing the 
double- stranded DNA molecules described in this 
application . 

A plant fatty acid hydroxylase of this 
10 invention includes any sequence of amino acids, such 
as a protein, polypeptide or peptide fragment, or 
nucleic acid sequences encoding such polypeptides, 
obtainable from a plant source which demonstrates 
the ability to catalyze the production of 
15 ricinoleic, lesquerolic, hydroxyerucic (16- 
hydroxydocos - cis- 13 -enoic acid) or 

hydroxypalmit oleic ( 12 -hydroxyhexa dec- cis- 9 -enoic ) 
from CoA, ACP or lipid-linked monoenoic fatty acid 
substrates under plant enzyme reactive conditions. 
20 By "enzyme reactive conditions" is meant that any 

necessary conditions are available in an environment 
(i.e., such factors as temperature, pH, lack of 
inhibiting substances) which will permit the enzyme 
to function. 

25 Preferential activity of a plant fatty acid 

hydroxylase toward a particular fatty acyl substrate 
is determined upon comparison of hydroxylated fatty 
acid product amounts obtained per different fatty 
acyl substrates. For example, by "oleate preferring" 

3 0 is meant that the hydroxylase activity of the enzyme 
preparation demonstrates a preference for oleate- 
containing substrates over other substrates. 
Although the precise substrate of the castor fatty 
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acid hydroxylase is not known, it is thought to be a 
monounsaturated fatty acid moiety which is 
esterified to a phospholipid such as 
phosphatidylcholine. However, it is also possible 
5 that monounsaturated fatty acids esterified to 

phosphatidylethanolamine, phosphatidic acid or a 
neutral lipid such as diacylglycerol or a Coenzyme-A 
thioester may also be substrates. 

As noted above, significant activity has been 

10 observed in radioactive labelling studies using 

fatty acyl substrates other than oleate (Howling et 
al., 1972) indicating that the substrate specificity 
is for a family of related fatty acyl compounds . 
Because the castor hydroxylase introduces hydroxy 

15 groups three carbons from a double bond, proximal to 
the methyl carbon of the fatty acid, the enzyme is 
termed a kappa hydroxylase for convenience. Of 
particular interest, the present invention discloses 
that the castor kappa hydroxylase may be used for 

20 production of 12 -hydroxy- 9 -octadecenoic acid 

(ricinoleate) , 12 - hydroxy- 9 -hexadecenoic acid, 14- 
hydroxy- il-eicosenoic acid, 16 -hydroxy- 13 -docosenoic 
acid, 9 -hydroxy- 6 -octadecenoic acid by expression in 
plants species which produce the non-hydroxylated 

25 precursors. The present invention also discloses 

production of additionally modified fatty acids such 
as 12-hydroxy- 9 , 15 - octadecadienoic acid that result 
from desaturation cf hydroxylated fatty acids (e.g., 
12-hydroxy-9-octadecenoic acid in this example) . 

3 0 The present invention also discloses that 

future advances in the genetic engineering of plants 
will lead to production of substrate fatty acids, 
such as icosenoic acid esters, and palmitoleic acid 
esters in plants than do not normally accumulate 
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such fatty acids. The invention described herein may 
be used in conjunction with such future improvements 
to produce hydroxylated fatty acids of this 
invention in any plant species that is amenable to 
5 directed genetic modification. Thus, the 

applicability of this invention is not limited in 
our conception only to those species that currently 
accumulate suitable substrates. 

As noted above, a plant kappa hydroxylase of 

10 this invention will display activity towards various 
fatty acyl substrates. During biosynthesis of lipids 
in a plant cell, fatty acids are typically 
covalently bound to acyl carrier protein (ACP) , 
coenzyme A (CoA) or various cellular lipids. Plant 

15 kappa hydroxylases which display preferential 

activity toward lipid- linked acyl substrate are 
especially preferred because they are likely to be 
closely associated with normal pathway of storage 
lipid synthesis in immature embryos. However, 

20 activity toward acyl -CoA substrates or other 
synthetic substrates, for example, is also 
contemplated herein . 

Other plant kappa hydroxylases are obtainable 
from the specific exemplified sequences provided 

25 herein. Furthermore, it will be apparent that one 
can obtain natural and synthetic plant kappa 
hydroxylases including modified amino acid sequences 
and starting materials for synthetic-protein 
modeling from the exemplified plant kappa 

3 0 hydroxylase and from plant kappa hydroxylases which 
are obtained through the use of such exemplified 
sequences. Modified amino acid sequences include 
sequences which have been mutated, truncated, 
elongated or the like, whether such sequences were 
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partially or wholly synthesized. Sequences which are 
actually purified from plant preparations or are 
identical or encode identical proteins thereto, 
regardless of the method used to obtain the protein 
5 or sequence, are equally considered naturally 
derived . 

Thus, one skilled in the art will readily 
recognize that antibody preparations, nucleic acid 
probes (DNA and RNA) or the like may be prepared and 

10 used to screen and recover "homologous" or "related" 
kappa hydroxylases from a variety of plant sources. 
Typically, nucleic acid probes are labeled to allow 
detection, preferably with radioactivity although 
enzymes or other methods may also be used. For 

15 immunological screening methods, antibody 

preparations either monoclonal or polyclonal are 
utilized. Polyclonal antibodies, although less 
specific, typically are more useful in gene 
isolation. For detection, the antibody is labeled 

2C using radioactivity or any one of a variety of 

second antibody/enzyme conjugate systems that are 
commercially available . 

Homologous sequences are found when there is 
an identity of sequence and may be determined upon 

25 comparison of sequence information, nucleic acid or 
amino acid, or through hybridization reactions 
between a known kappa hydroxylase and a candidate 
source. Conservative changes, such as Glu/Asp, 
Val/Ile, Ser/Thr, Arg/Lys and Gln/Asn may also be 

3 0 considered in determining sequence homology. 

Typically, a lengthy nucleic acid sequence may show 
as little as 50-60% sequence identity, and more 
preferably at least about 70% sequence identity, 
between the target sequence and the given plant 
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kappa hydroxylase of interest excluding any 
deletions which may be present, and still be 
considered related. Amino acid sequences are 
considered homologous by as little as 25% sequence 
5 identity between the two complete mature proteins, 
(see generally, Doolittle, R.F., OF URFS and ORFS , 
University Science Books, CA, 1986.) 

A genomic or other appropriate library 
prepared from the candidate plant source of interest 

10 may be probed with conserved sequences from the 
plant kappa hydroxylase to identify homologously 
related sequences. Use of an entire cDNA or other 
sequence may be employed if shorter probe sequences 
are not identified. Positive clones are then 

15 analyzed by restriction enzyme digestion and/or 

sequencing. When a genomic library is used, one or 
more sequences may be identified providing both the 
coding region, as well as the transcriptional 
regulatory elements of the kappa hydroxylase gene 

2 0 from such plant source. Probes can also be 

considerably shorter than the entire sequence. 
Oligonucleotides may be used, for example, but 
should be at least about 10, preferably at least 
about 15, more preferably at least 20 nucleotides in 

25 length. When shorter length regions are used for 

comparison, a higher degree of sequence identity is 
required than for longer sequences. Shorter probes 
are often particularly useful for polymerase chain 
reactions (PCR) , especially when highly conserved 

30 sequences can be identified (see Gould et al . , 1989 
for examples of the use of PCR to isolate homologous 
genes from taxonomical ly diverse species). 

When longer nucleic acid fragments are 
employed (>100 bp) as probes, especially when using 
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complete or large cDNA sequences, one would screen 
with low stringencies (for example, 40-50°C below 
the melting temperature of the probe) in order to 
obtain signal from the target sample with 20-50% 
5 deviation, i.e., homologous sequences (Beltz et al . , 
1983) . 

In a preferred embodiment, a plant kappa 
hydroxylase of this invention will have at least 60% 
overall amino acid sequence similarity with the 

10 exemplified plant kappa hydroxylase. In particular, 
kappa hydroxylases which are obtainable from an 
amino acid or nucleic acid sequence of a castor or 
Lesquerella kappa hydroxylase are especially 
preferred. The plant kappa hydroxylases may have 

15 preferential activity toward longer or shorter chain 
fatty acyl substrates . Plant fatty acyl hydroxylases 
having oleate- 12 -hydroxylase activity and 
eicosenoate-14-hydroxylase activity are both 
considered homologously related proteins because of 

20 in v±tro evidence (Howling et al . , 1972), and 

evidence disclosed herein, that the castor kappa 
hydroxylase will act on both substrates . 
Hydroxylated fatty acids may be subject to further 
enzymatic modification by other enzymes which are 

25 normally present or are introduced by genetic 

engineering methods. For example, 14 -hydroxy- 11, 17- 
eicosadienoic acid, which is present in some 
Lesquerella. species (Smith, 1985), is thought to be 
produced by desaturation of 14 - hydroxy- 1 1 - e icoseno i c 

30 acid. 

Again, not only can gene clones and materials 
derived therefrom be used to identify homologous 
plant fatty acyl hydroxylases, but the resulting 
sequences obtained therefrom may also provide a 
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further method to obtain plant fatty acyl 
hydroxylases from other plant sources. In 
particular, PCR may be a useful technique to obtain 
related plant fatty acyl hydroxylases from sequence 
5 data provided herein. One skilled in the art will be 
able to design oligonucleotide probes based upon 
sequence comparisons or regions of typically highly 
conserved sequence. Of special interest are 
polymerase chain reaction primers based on the 

10 conserved regions of amino acid sequence between the 
castor kappa hydroxylase and the L. fendleri 
hydroxylase (SEQ ID N0:4) . Details relating to the 
design and methods for a PCR reaction using these 
probes are described more fully in the examples. 

15 It should also be noted that the fatty acyl 

hydroxylases of a variety of sources can be used to 
investigate fatty acid hydroxylat ion events in a 
wide variety of plant and in vivo applications. 
Because all plants synthesize fatty acids via a 

20 common metabolic pathway, the study and/or 

application of one plant fatty acid hydroxylase to a 
heterologous plant host may be readily achieved in a 
variety of species. 

Once the nucleic acid sequence is obtained, 

25 the transcription, or transcription and translation 
(expression) , of the plant fatty acyl hydroxylases 
in a host cell is desired to produce a ready source 
of the enzyme and/or modify the composition of fatty 
acids found therein in the form of free fatty acids, 

30 esters (particularly esterified to glycerolipids or 
as components of wax esters), estolides, or ethers. 
Other useful applications may be found when the host 
cell is a plant host cell, in vitro and in vivo. For 
example, by increasing the amount of an kappa 
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hydroxylase available to the plant, an increased 
percentage of ricinoleate or lesqueroleate (14- 
hydroxy-ll-eicosenoic acid) may be provided. 

Kappa Hydroxylase 
5 By this invention, a mechanism for the 

biosynthesis of ricinoleic acid in plants is 
demonstrated. Namely, that a specific plant kappa 
hydroxylase having preferential activity toward 
fatty acyl substrates is involved in the 

10 accumulation of hydroxylated fatty acids in at least 
some plant species. The use of the terms ricinoleate 
or ricinoleic acid (or lesqueroleate or lesquerolic 
acid, densipoleate etc.) is intended to include the 
free acids, the ACP and CoA esters, the salts of 

15 these acids, the glycerolipid esters (particularly 
the triacylglycerol esters), the wax esters, the 
estolides and the ether derivatives of these acids. 

The determination that plant fatty acyl 
hydroxylases are active in the in vivo production of 

2 0 hydroxylated fatty acids suggests several 

possibilities for plant enzyme sources. In fact, 
hydroxylated fatty acids are found in some natural 
plant species in abundance. For example, three 
hydroxy fatty acids related to ricinoleate occur in 
25 major amounts in seed oils from various Lescpjerella 
species. Of particular interest, lesquerolic acid is 
a 20 carbon homolog of ricinoleate with two 
additional carbons at the carboxyl end of the chain 
(Smith, 1985) . Other natural plant sources of 

3 0 hydroxylated fatty acids include but are not limited 

to seeds of the Linum genus, seeds of Wrightia. 
species, Lycopodium species, Strophanthus species, 



WO 97/30582 



PCTYUS97/02187 



24 



Convolvulaces species, Calendula, species and many 
others (van de Loo et al., 1993) . 

Plants having significant presence of 
ricinoleate or lesqueroleate or desaturated other or 
5 modified derivatives of these fatty acids are 

preferred candidates to obtain naturally-derived 
kappa hydroxylases. For example, Lesquerella 
densipila contains a diunsaturated 18 carbon fatty 
acid with a hydroxyl group (van de Loo et al. # 1993) 

10 that is thought to be produced by an enzyme that is 
closely related to the castor kappa hydroxylase, 
according to the theory on which this invention is 
based. In addition, a comparison between kappa 
hydroxylases and between plant fatty acyl 

15 hydroxylases which introduce hydroxyl groups at 

positions other than the 12 -carbon of oleate or the 
14 -carbon of lesqueroleate or on substrates other 
than oleic acid and icosenoic acid may yield 
insights for gene identification, protein modeling 

20 or other modifications as discussed above. 

Especially of interest are fatty acyl 
hydroxylases which demonstrate activity toward fatty 
acyl substrates other than oleate, or which 
introduce the hydroxyl group at a location other 

2 5 than the C12 carbon. As described above, other plant 

sources may also provide sources for these enzymes 
through the use of protein purification, nucleic 
acid probes, antibody preparations, protein 
modeling, or sequence comparisons, for example, and 

3 0 of special interest are the respective amino acid 

and nucleic acid sequences corresponding to such 
plant fatty acyl hydroxylases. Also, as previously 
described, once a nucleic acid sequence is obtained 
for the given plant hydroxylase, further plant 
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sequences may be compared and/or probed to obtain 
homologously related DNA sequences thereto and so 
on . 

Genetic Engineering Applications 
5 As is well known in the art, once a cDNA 

clone encoding a plant kappa hydroxylase is 
obtained, it may be used to obtain its corresponding 
genomic nucleic acid sequences thereto. 

The nucleic acid sequences which encode plant 

10 kappa hydroxylases may be used in various 

constructs , for example, as probes to obtain further 
sequences from the same or other species. 
Alternatively, these sequences may be used in 
conjunction with appropriate regulatory sequences to 

15 increase levels of the respective hydroxylase of 
interest in a host cell for the production of 
hydroxylated fatty acids or study of the enzyme in 
vitro or in vivo or to decrease or increase levels 
of the respective hydroxylase of interest for some 

20 applications when the host cell is a plant entity, 
including plant cells, plant parts (including but 
not limited to seeds, cuttings or tissues) and 
plants . 

A nucleic acid sequence encoding a plant 
2 5 kappa hydroxylase of this invention may include 
genomic, cDNA or mRNA sequence. By "encoding" is 
meant that the sequence corresponds to a particular 
amino acid sequence either in a sense or anti-sense 
orientation. By "recombinant" is meant that the 
30 sequence contains a genetically engineered 

modification through manipulation via mutagenesis, 
restriction enzymes, or the like. A cDNA sequence 
may or may not encode pre-processing sequences, such 
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as transit or signal peptide sequences. Transit or 
signal peptide sequences facilitate the delivery of 
the protein to a given organelle and are frequently- 
cleaved from the polypeptide upon entry into the 
5 organelle, releasing the "mature" sequence. The use 
of the precursor DNA sequence is preferred in plant 
cell expression cassettes. 

Furthermore, as discussed above the complete 
genomic sequence of the plant kappa hydroxylase may 
10 be obtained by the screening of a genomic library 
with a probe, such as a cDNA probe, and isolating 
those sequences which regulate expression in seed 
tissue . 

Once the desired plant kappa hydroxylase 

15 nucleic acid sequence is obtained, it may be 

manipulated in a variety of ways. Where the sequence 
involves non-coding flanking regions, the flanking 
regions may be subjected to resection, mutagenesis, 
etc. Thus, transitions, transversions , deletions, 

20 and insertions may be performed on the naturally 

occurring sequence. In addition, all or part of the 
sequence may be synthesized. In the structural gene, 
one or more codons may be modified to provide for a 
modified amino acid sequence, or one or more codon 

25 mutations may be introduced to provide for a 
convenient restriction site or other purpose 
involved with construction or expression. The 
structural gene may be further modified by employing 
synthetic adapters, linkers to introduce one or more 

30 convenient restriction sites, or the like. 

The nucleic acid or amino acid sequences 
encoding a plant kappa hydroxylase of this invention 
may be combined with other non-native, or 
"heterologous", sequences in a variety of ways. By 



WO 97/30582 



PCT/US97/02187 



27 



"heterologous" sequences is meant any sequence which 
is not naturally found joined to the plant kappa 
hydroxylase, including, for example, combination of 
nucleic acid sequences from the same plant which are 
5 not naturally found joined together. 

The DNA sequence encoding a plant kappa 
hydroxylase of this invention may be employed in 
conjunction with all or part of the gene sequences 
normally associated with the kappa hydroxylase. In 

10 its component parts, a DNA sequence encoding kappa 
hydroxylase is combined in a DNA construct having, 
in the 5' to 3 ' direction of transcription, a 
transcription initiation control region capable of 
promoting transcription and/or translation in a host 

15 cell, the DNA sequence encoding plant kappa 

hydroxylase and a transcription and/or translation 
termination region . 

Potential host cells include both prokaryotic 
and eukaryotic cells. A host cell may be unicellular 

20 or found in a multicellular differentiated or 
undifferentiated organism depending upon the 
intended use. Cells of this invention may be 
distinguished by having a plant kappa hydroxylase 
foreign to the wild-type cell present therein, for 

25 example, by having a recombinant nucleic acid 
construct encoding a plant kappa hydroxylase 
therein . 

Depending upon the host, the regulatory 
regions will vary, including regions from viral, 
3 0 plasmid or chromosomal genes, or the like. For 
expression in prokaryotic or eukaryotic 
microorganisms, particularly unicellular hosts, a 
wide variety of constitutive or regulatable 
promoters may be employed . Expression in a 
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microorganism can provide a ready source of the 
plant enzyme. Among transcriptional initiation 
regions which have been described are regions from 
bacterial and yeast hosts, such as E. coli, B . 
5 subtilis , Sac char omyces cerevisiae, including genes 
such as beta-galactosidase, T7 polymerase, trpE or 
the like. 

For the most part, the constructs will 
involve regulatory regions functional in plants 

10 which provide for modified production of plant kappa 
hydroxylase with resulting modification of the fatty 
acid composition. The open reading frame, coding for 
the plant kappa hydroxylase or functional fragment 
thereof will be joined at its 5' end to a 

15 transcription initiation regulatory region. Numerous 
transcription initiation regions are available which 
provide for a wide variety of constitutive or 
regulatable, e.g., inducible, transcription of the 
structural gene functions. 

2 0 Among transcriptional initiation regions used 

for plants are such regions associated with the 
structural genes such as for nopaline and mannopine 
synthases, or with napin, soybean /3-conglycinin , 
oleosin, 12S storage protein, the cauliflower mosaic 
25 virus 35S promoters or the like. The transcription/ 
translation initiation regions corresponding to such 
structural genes are found immediately 5' upstream 
to the respective start codons . 

In embodiments wherein the expression of the 

3 0 kappa hydroxylase protein is desired in a plant 

host, the use of all or part of the complete plant 
kappa hydroxylase gene is desired. If a different 
promoter is desired, such as a promoter native to 
the plant host of interest or a modified promoter, 
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i.e., having transcription initiation regions 
derived from one gene source and translation 
initiation regions derived from a different gene 
source or enhanced promoters, such as double 3 5S 
5 CaMV promoters, the sequences may be joined together 
using standard techniques. 

For such applications when 5' upstream non- 
coding regions are obtained from other genes 
regulated during seed maturation, those 

10 preferentially expressed in plant embryo tissue, 
such as transcription initiation control regions 
from the B . napus napin gene, or the Arabidopsis 12S 
storage protein, or soybean 0-conglycinin (Bray et 
al . , 1987) are desired. Transcription initiation 

15 regions which are preferentially expressed in seed 
tissue, i.e., which are undetectable in other plant 
parts, are considered desirable for fatty acid 
modifications in order to minimize any disruptive or 
adverse effects of the gene product . 

20 Regulatory transcript termination regions may 

be provided in DNA constructs of this invention as 
well . Transcript termination regions may be provided 
by the DNA sequence encoding the plant kappa 
hydroxylase or a convenient transcription 

25 termination region derived from a different gene 
source, for example, the transcript termination 
region which is naturally associated with the 
transcript initiation region. Where the transcript 
termination region is from a different gene source, 

3 0 it will contain at least about 0.5 kb, preferably 
about 1-3 kb of sequence 3' to the structural gene 
from which the termination region is derived. 

Plant expression or transcription constructs 
having a plant kappa hydroxylase as the DNA sequence 
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of interest for increased or decreased expression 
thereof may be employed with a wide variety of plant 
life, particularly, plant life involved in the 
production of vegetable oils for edible and 
5 industrial uses. Most especially preferred are 

temperate oilseed crops. Plants of interest include, 
but are not limited to rapeseed (canola and high 
erucic acid varieties) , Crambe, Brassica juncea, 
Brassica nigra, meadowfoam, flax, sunflower, 

10 saf flower, cotton, Cuphea, soybean, peanut, coconut 
and oil palms and corn. An important criterion in 
the selection of suitable plants for the 
introduction on the kappa hydroxylase is the 
presence in the host plant of a suitable substrate 

15 for the hydroxylase. Thus, for example, production 
of ricinoleic acid will be best accomplished in 
plants that normally have high levels of oleic acid 
in seed lipids. Similarly, production of lesquerolic 
acid will best be accomplished in plants that have 

20 high levels of icosenoic acid in seed lipids. 

Depending on the method for introducing the 
recombinant constructs into the host cell, other DNA 
sequences may be required. Importantly, this 
invention is applicable to dicotyledons and 

25 monocotyledons species alike and will be readily 

applicable to new and/or improved transformation and 
regulation techniques. The method of transformation 
is not critical to the current invention; various 
methods of plant transformation are currently 

3 0 available. As newer methods are available to 
transform crops, they may be directly applied 
hereunder. For example, many plant species naturally 
susceptible to Agrobac terium infection may be 
successfully transformed via tripartite or binary 
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vector methods of Agrobacterium mediated 
transformation. In addition, techniques of 
microinjection, DNA particle bombardment, 
electroporat ion have been developed which allow for 
5 the transformation of various monocot and dicot 
plant species . 

In developing the DNA construct, the various 
components of the construct or fragments thereof 
will normally be inserted into a convenient cloning 

10 vector which is capable of replication in a 

bacterial host, e.g., E . coli. Numerous vectors 
exist that have been described in the literature. 
After each cloning, the plasrnid may be isolated and 
subjected to further manipulation, such as 

15 restriction, insertion of new fragments, ligation, 
deletion, insertion, resection, etc., so as to 
tailor the components of the desired sequence. Once 
the construct has been completed, it may then be 
transferred to an appropriate vector for further 

20 manipulation in accordance with the manner of 
transformation of the host cell. 

Normally, included with the DNA construct 
will be a structural gene having the necessary 
regulatory regions for expression in a host and 

25 providing for selection of transformant cells. The 
gene may provide for resistance to a cytotoxic 
agent, e.g., antibiotic, heavy metal, toxin, etc., 
complementation providing prototropy to an 
auxotrophic host, viral immunity or the like. 

30 Depending upon the number of different host species 
the expression construct or components thereof are 
introduced, one or more markers may be employed, 
where different conditions for selection are used 
for the different hosts. 
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It is noted that the degeneracy of the DNA 
code provides that some codon substitutions are 
permissible of DNA sequences without any 
corresponding modification of the amino acid 
5 sequence. 

As mentioned above, the manner in which the 
DNA construct is introduced into the plant host is 
not critical to this invention. Any method which 
provides for efficient transformation may be 

10 employed. Various methods for plant cell 

transformation include the use of Ti- or Ri- 
plasmids , microinjection, electroporation, 
infiltration, imbibition, DNA particle bombardment, 
liposome fusion, DNA bombardment or the like. In 

15 many instances, it will be desirable to have the 
construct bordered on one or both sides of the T- 
DNA, particularly having the left and right borders, 
more particularly the right border. This is 
particularly useful when the construct uses A . 

2 0 tumefaciens or A. rhizogenes as a mode for 

transformation, although the T -DNA borders may find 
use with other modes of transformation. 

Where Agrobacterium is used for plant cell 
transformation, a vector may be used which may be 

2 5 introduced into the Agrobacterium host for 

homologous recombination with T-DNA or the Ti- or 
Ri-plasmid present in the Agrobacterium host. The 
Ti- or Ri-plasmid containing the T-DNA for 
recombination may be armed (capable of causing gall 

30 formation) or disarmed (incapable of causing gall) , 
the latter being permissible, so long as the vir 
genes are present in the transformed Agrobacterium 
host. The armed plasmid can give a mixture of normal 
plant cells and gall. 
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In some instances where Agrobacterium is used 
as the vehicle for transforming plant cells, the 
expression construct bordered by the T-DNA border (s) 
will be inserted into a broad host spectrum vector, 
5 there being broad host spectrum vectors described in 
the literature. Commonly used is pRK2 or derivatives 
thereof. See, for example, Ditta et al . (1980), 
which is incorporated herein by reference . Included 
with the expression construct and the T-DNA will be 

10 one or more markers, which allow for selection of 
transformed Agrobac terixim and transformed plant 
cells. A number of markers have been developed for 
use with plant cells, such as resistance to 
kanamycin, the aminoglycoside G418, hygromycin, or 

15 the like. The particular marker employed is not 

essential to this invention, one or another marker 
being preferred depending on the particular host and 
the manner of construction. 

For transformation of plant cells using 

20 Agrrojbacterium, explants may be combined and 

incubated with the transformed Agrobacterium for 
sufficient time for transformation, the bacteria 
killed, and the plant cells cultured in an 
appropriate selective medium. Once callus forms, 

25 shoot formation can be encouraged by employing the 
appropriate plant hormones in accordance with known 
methods and the shoots transferred to rooting medium 
for regeneration of plants. The plants may then be 
grown to seed and the seed used to establish 

3 0 repetitive generations and for isolation of 
vegetable oils. 



WO 97/30582 



PCT/US97/02187 



34 



Using Hydroxylase Genes to Alter the Activity of 
Fatty Acid Desaturases 

A widely acknowledged goal of current efforts 
to improve the nutritional quality of edible plant 
5 oils, or to facilitate industrial applications of 

plant oils, is to alter the level of desaturation of 
plant storage lipids (Topfer et al., 1995). In 
particular, in many crop species it is considered 
desirable to reduce the level of polyunsaturat ion of 

10 storage lipids and to increase the level of oleic 

acid. The precise amount of the various fatty acids 
in a particular plant oil varies with the intended 
application. Thus, it is desirable to have a robust 
method that will permit genetic manipulation of the 

15 level of unsaturation to any desired level. 

Substantial progress has recently been made 
in the isolation of genes encoding plant fatty acid 
desaturases (reviewed in Topfer et al . , 1995) . These 
genes have been introduced into various plant 

20 species and used to alter the level of fatty acid 

unsaturation in one of three ways. First, the genes 
can be placed under transcriptional control of a 
strong promoter so that the amount of the 
corresponding enzyme is increased. In some cases 

25 this leads to an increase in the amount of the fatty 
acid that is the product of the reaction catalyzed 
by the enzyme. For example, Arondel et al. (1992) 
increased the amount of linolenic acid (18:3) in 
tissues of transgenic Arabidopsis plants by placing 

30 the endoplasmic ret iculum- localized fad3 gene under 
transcriptional control of the strong constitutive 
cauliflower mosaic virus 35S promoter. 

A second method of using cloned genes to 
alter the level of fatty acid unsaturation is to 
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cause transcription of all or part of a gene in 
transgenic tissues so that the transcripts have an 
antisense orientation relative to the normal mode of 
transcription. This has been used by a number of 
5 laboratories to reduce the level of expression of 
one or more desaturase genes that have significant 
nucleotide sequence homology to the gene used in the 
construction of the antisense gene (reviewed in 
Topfer et al . ) . For instance, antisense repression 

10 of the oleate A12 -desaturase in transgenic rapeseed 
resulted in a strong increase in oleic acid content 
(cf., Topfer et al . , 1995). 

A third method for using cloned genes to 
alter fatty acid desaturation is to exploit the 

15 phenomenon of cosuppression or "gene-silencing" 
(Matzke et al . , 1995). Although the mechanisms 
responsible for gene silencing are not known in any 
detail, it has frequently been observed that in 
transgenic plants, expression of an introduced gene 

20 leads to inactivation of homologous endogenous 
genes . 

For example, high-level sense expression of 
the Arabidopsis fad8 gene, which encodes a 
chloroplast -local i zed Al 5 -desaturase , in transgenic 

25 Arabidopsis plants caused suppression of the 

endogenous copy of the fad8 gene and the homologous 
fad7 gene (which encodes an isozyme of the fad8 
gene) (Gibson et al., 1994). The fad7 and fad8 genes 
are only 76% identical at the nucleotide level. At 

30 the time of publication, this example represented 
the most divergent pair of plant genes for which 
cosuppression had been observed. 

In view of previous evidence concerning the 
relatively high ] eve '. o F nucleotide sequence 
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homology required to obtain cosuppression, it is not 
obvious to one skilled in the art that sense 
expression in transgenic plants of the castor fatty 
acyl hydroxylase of this invention would 
5 significantly alter the amount of unsaturation of 
storage lipids. 

However, the present inventors establish that 
fatty acyl hydroxylase genes can be used for this 
purpose as taught in Example 4 of this 

10 specification. Of particular importance, this 

invention teaches the use of fatty acyl hydroxylase 
genes to increase the proportion of oleic acid in 
transgenic plant tissues. The mechanism by which 
expression of the gene exerts this effect is not 

15 known but may be due to one of several possibilities 
which are elaborated upon in Example 4 . 

The invention now being generally described, 
it will be more readily understood by reference to 
the following examples which are included for 

2 0 purposes of illustration only and are not intended 

to limit the present invention. 

EXAMPLES 

In the experimental disclosure which follows, 
all temperatures are given in degrees centigrade 
25 (°C), weights are given in grams (g) , milligram (mg) 
or micrograms (/xg) , concentrations are given as 
molar (M) , millimolar (mM) or micromolar (fiM) and 
all volumes are given in liters (1), microliters 
(/xl) or milliliters (ml), unless otherwise 

3 0 indicated. 
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EXAMPLE 1 - PRODUCTION OF NOVEL HYDROXYLATED FATTY 

ACIDS IN ARABIDOPSIS T HAL I ANA 

Overview 

The kappa hydroxylase encoded by the fahl2 
5 gene from castor was used to produce ricinoleic 
acid, lesquerolic acid, densipolic acid and 
auricolic acid in transgenic Arabidopsis plants. 

Production of transgenic plants 

A variety of methods have been developed to 

10 insert a DNA sequence of interest into the genome of 
a plant host to obtain the transcription and 
translation of the sequence to effect phenotypic 
changes. The following methods represent only one of 
many equivalent means of producing transgenic plants 

15 and causing expression of the hydroxylase gene. 

Arabidopsls plants were transformed, by 
Agrobsicterium- mediated transformation, with the 
kappa hydroxylase encoded by the castor fahl2 gene 
on binary Ti plasmid pB6 . This plasmid has also been 

2 0 used to transform Nicotians, tabacum for the 

production of ricinoleic acid. 

Inoculums of AgroJbac teriu/n tuzne-faciens strain 
GV3101 containing binary Ti plasmid pB6 were plated 
on L-broth plates containing 50 /ig/ml kanamycin and 
25 incubated for 2 days at 30°C. Single colonies were 
used to inoculate large liquid cultures (L-broth 
medium with 50 mg/1 rifampicin, 110 mg/1 gentamycin 
and 200 mg/1 kanamycin) to be used for the 
transformation of Arabidopsis plants. 

3 0 Arabidopsis plants were transformed by the in 

planta transformation procedure essentially as 
described by Bechtold et al . (1993). Cells of A. 
zumefaciens GV3101(pB6) were harvested from liquid 



WO 97/30582 



PCT/US97/02187 



38 



cultures by centrifugation, then resuspended in 
infiltration medium at OD 600 = 0.8. Infiltration 
medium was Murashige and Skoog macro and 
micronutrient medium {Sigma Chemical Co., St. Louis, 
5 MO) containing 10 mg/1 6-benzylaminopurine and 5% 

glucose. Batches of 12-15 plants were grown for 3 to 
4 weeks in natural light at a mean daily temperature 
of approximately 25°C in 3 . 5 inch pots containing 
soil. The intact plants were immersed in the 

10 bacterial suspension then transferred to a vacuum 
chamber and placed under 60 0 mm of vacuum produced 
by a laboratory vacuum pump until tissues appeared 
uniformly water-soaked (approximately 10 rnin) . The 
planes were grown at 25 °C under continuous light 

15 (100 /imol m" 2 s" 1 irradiation in the 400 to 700 nm 

range) for four weeks. The seeds obtained from all 
the plants in a pot were harvested as one batch. The 
seeds were sterilized by sequential treatment for 2 
min with ethanol followed by 10 min in a mixture of 

20 household bleach (Chlorox) , water and Tween-80 (50%, 
50%, 0.05%) then rinsed thoroughly with sterile 
water. The seeds were plated at high density (2000 
to 4000 per plate) onto agar-solidif ied medium in 
100 mm petri plates containing 1/2 X Murashige and 

25 Skoog salts medium enriched with B5 vitamins (Sigma 
Chemical Co., St. Louis, MO) and containing 
kanamycin at 50 mg/1. After incubation for 48 h at 
4°C to stimulate germination, seedlings were grown 
for a period of seven days until transf ormants were 

30 clearly identifiable as healthy green seedlings 
against a background of chlorotic kanamycin- 
sensitive seedlings. The transf ormants were 
transferred to soil for two weeks before leaf tissue 
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could be used for DNA and lipid analysis. More than 
2 0 transformants were obtained. 

DNA was extracted from young leaves from 
transformants to verify the presence of an intact 
5 fahl2 gene. The presence of the transgene in a 

number of the putative transgenic lines was verified 
by using the polymerase chain reaction to amplify 
the insert from pB6 . The primers used were HF2 = 
GCTCTTTTGTGCGCTCATTC (SEQ ID NO: 12) and HR1 = 

10 CGG TAC CAGAAAAC GC CTTG (SEQ ID NO: 13), which were 
designed to allow the amplification of a 700 bp 
fragment. Approximately 100 ng of genomic DNA was 
added to a solution containing 25 pmol of each 
primer, 1.5 U Taq polymerase (Boehringer Manheim) , 

15 200 uM of dNTPs , 50 mM KCl , 10 mM Tris.Cl (pH 9), 
0.1% (v/v) Triton X-100, 1,5 mM MgCl 2 , 3% (v/v) 
formamide, to a final volume of 50 /xl. 
Amplifications conditions were: 4 min denaturation 
step at 94°C, followed by 30 cycles of 92°C for 1 

20 min, 55°C for 1 min, 72°C for 2 min. A final 

extension step closed the program at 72 °C for 5 min. 
Transformants could be positively identified after 
visualization of a characteristic 1 kb amplified 
fragment on an ethidium bromide stained agarose gel. 

25 All transgenic lines tested gave a PCR product of a 
size consistent with the expected genotype, 
confirming that the lines were, indeed, transgenic. 
All further experiments were done with three 
representative transgenic lines of the wild type 

30 designated as 1-3, 4D, 7-4 and one transgenic line 
of the fad2 mutant line JB12 . The transgenic JB12 
line was included in order to test whether the 
increased accumulation of oleic acid in this mutant 
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would have an effect on the amount of ricinoleic 
acid that accumulated in the transgenic plants. 

Analysis of transgenic plants 

Leaves and seeds from fahl2 transgenic 
5 Arabidopsis plants were analyzed for the presence of 
hydroxylated fatty acids using gas chromatography. 
Lipids were extracted from 100-200 mg leaf tissue or 
50 seeds. Fatty acid methyl esters (FAMES) were 
prepared by placing tissue in 1 . 5 ml of 1.0 M 

10 methanolic HC1 (Supelco Co.) in a 13 x 100 mm glass 
screw-cap tube capped with a teflon- lined cap and 
heated to 80°C for 2 hours. Upon cooling, 1 ml 
petroleum ether was added and the FAMES removed by 
aspirating off the ether phase which was then dried 

15 under a nitrogen stream in a glass tube. One hundred 
fil of N, O-bis (Trimethylsilyl) trif luoroacetamide 
(BSTFA; Pierce Chemical Co) and 200 fil acetonitrile 
was added to derivatize the hydroxyl groups. The 
reaction was carried out at 70°C for 15 min. The 

20 products were dried under nitrogen, redissolved in 
100 /il chloroform and transferred to a gas 
chromatograph vial. Two fil of each sample were 
analyzed on a SP2340 fused silica capillary column 
(30 m, 0.75 mm ID, 0.20 mm film, Supelco), using a 

2 5 Hewlett-Packard 5890 II series Gas Chromatograph. 

The samples were not split, the temperature program 
was 195°C for 18 min, increased to 230°C at 
25°C/min, held at 230°C for 5 min then down to I95°C 
at 25°C/min. , and flame ionization detectors were 

30 used. 

The chromatographic elution time of methyl 
esters and O-TMS derivatives of ricinoleic acid, 
lesquerolic acid and vjricolic acid was established 
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by GC-MS of lipid samples from seeds of L . fendleri 
and comparison to published chromatograms of fatty 
acids from this species (Carlson et al . , 1990) . A O- 
TMS-methyl-ricinoleate standard was prepared from 
5 ricinoleic acid obtained from Sigma Chemical Co {St, 
Louis, MO) . O-TMS-methyl-lesqueroleate and O-TMS- 
methyl -auricoleate standards were prepared from 
triacylglycerols purified from seeds of L. fendleri. 
The mass spectrum of O-TMS-methyl - ricinoleate , O- 

10 TMS -methyl -dens ipol eat e , 0-TMS -methyl - lesqueroleate , 
and O-TMS-methyl -auricoleate are shown in Figures 
1A-D , respectively. The structures of the 
characteristic ions produced during mass 
spectrometry of these derivatives are shown in 

15 Figure 2. 

Lipid extracted from transgenic tissues were 
analyzed by gas chromatography and mass spectrometry 
for the presence of hydroxylated fatty acids . As a 
matter of reference, the average fatty acid 

20 composition of leaves in Arabidopsis wild type and 
fad2 mutant lines was reported by Miquel and Browse 
(1992). Gas chromatograms of methylated and 
silylated fatty acids from seeds of wild type and a 
fah!2 transgenic wild type plant are shown in 

25 Figures 3A and 3B, respectively. The profiles are 

very similar except for the presence of three small 
but distinct peaks at 14.3, 15.9 and 18.9 minutes. A 
very small peak at 2 0.15 min was also evident. The 
elution time of the peaks at 14.3 and 18.9 min 

3 0 corresponded precisely to that of comparably 

prepared ricinoleic and lesquerolic standards, 
respectively. No significant differences were 
observed in lipid extracts from leaves or roots of 
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the wild type and the fahl2 transgenic wild type 
lines (Table 1) . 

Thus, in spite of the fact that the fahl2 
gene is expressed throughout the plant, effects on 
5 fatty acid composition was observed only in seed 
tissue. The present inventors have made a similar 
observation for transgenic fahl2 tobacco. 

Table 1. Fatty acid composition of lipids from 
transgenic and wild type Arabidopsis . The values are 
10 the means obtained from analysis of samples from 
three independent transgenic lines, or three 
independent samples of wild type and fad2 lines. 
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In order to confirm that the observed new 
peaks in the transgenic lines corresponded to 
derivatives of ricinoleic, lesquerolic, densipolic 
and auricolic acids, mass spectrometry was used. The 
5 fatty acid derivatives were resolved by gas 

chromatography as described above except that a 
Hewlett-Packard 5971 series mass selective detector 
was used in place of the flame ionization detector 
used in the previous experiment . The spectra of the 

10 four new peaks in Figure 3B (peak numbers 10, 11, 12 
and 13) are shown in Figures 4A-D, respectively. 
Comparison of the spectrum obtained for the 
standards with that obtained for the four peaks from 
the transgenic lines confirms the identity of the 

15 four new peaks. On the basis of the three 

characteristic peaks at M/Z 187, 270 and 299, peak 
10 is unambiguously identified as 0-TMS- 
methylricinoleate . On the basis of the three 
characteristic peaks at M/Z 185, 270 and 299, peak 

20 11 is unambiguously identified as O-TMS- 

methyldensipoleate . On the basis of the three 
characteristic peaks at M/Z 187, 298 and 327, peak 

12 is unambiguously identified as 0-TMS- 
methyllesqueroleate . On the basis of the three 

25 characteristic peaks at M/Z 185, 298 and 327, peak 

13 is unambiguously identified as 0-TMS- 
methylauricoleate . 

These resul t s unequivocally demonstrate the 
identity of the fahl2 cDNA as encoding a hydroxylase 
3 0 that hydroxylates both oleic acid to produce 

ricinoleic acid and also hydroxylates icosenoic acid 
to produce lesquerolic acid. These results also 
provide additional evidence that the hydroxylase can 
be functionally expressed in a heterologous plant 
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species in such a way that the enzyme is 
catalytically functional. These results also 
demonstrate that expression of this hydroxylase gene 
leads to accumulation of ricinoleic, lesquerolic , 
5 densipolic and auricolic acids in a plant species 

that does not normally accumulate hydroxylated fatty 
acids in extractable lipids. 

The present inventors expected to find 
lesquerolic acid in the transgenic plants based on 
10 the biochemical evidence suggesting broad substrate 
specificity of the kappa hydroxylase. By contrast, 
the accumulation of densipolic and auricolic acids 
was less predictable. Since Arabidopsis does not 
normally contain significant quantities of the non- 
15 hydroxylated precursors of these fatty acids which 
could serve as substrates for the hydroxylase, it 
appears that one or more of the three n-3 fatty acid 
desaturases known in Arabidopsis (e.g., fad3, f ad7 , 
fad8; reviewed in Gibson et al., 1995) are capable 
20 of desaturating the hydroxylated compounds at the n- 
3 position. That is, densipolic acid is produced by 
the action of an n-3 desaturase on ricinoleic acid. 
Auricolic acid is produced by the action of an n-3 
desaturase on lesquerolic acid. Because it is 
25 located in the endoplasmic reticulum, the fad3 

desaturase is almost certainly responsible. This can 
be tested in the future by producing fahl2- 
containing transgenic plants of the fad3 -deficient 
mutant of Arabidopsis (similar experiments can be 
30 done with fad7 and fad8) . It is also formally 

possible that the enzymes that normally elongate 
18:l cisA9 to 20:l cisa11 may elongate 120H- 18 : 1 C1SA9 to 
14OH-20 : l cisA11 , and 120H - 1 8 : 2 C1SA9 ' 15 to 140H- 2 0 : 2 ctaA11 ' 17 . 



WO 97/30582 



PCTYUS97/02187 



47 



The amount of the various fatty acids in 
seed, leaf and root lipids of the control and 
transgenic plants is also presented in Table 1. 
Although the amount of hydroxylated fatty acids 
5 produced in this example is less than desired for 
production of ricinoleate and other hydroxylated 
fatty acids from plants, numerous improvements may 
be envisioned that will increase the level of 
accumulation of hydroxylated fatty acids in plants 

10 that express the fahl2 or related hydroxylase genes. 
Improvements in the level and tissue specificity of 
expression of the hydroxylase gene are envisioned. 
Methods to accomplish this by the use of strong, 
seed-specific promoters such as the B . napus napin 

15 promoter will be obvious to one skilled in the art. 
Additional improvements are envisioned that involve 
modification of the enzymes which cleave 
hydroxylated fatty acids from phosphatidylcholine, 
reduction in the activities of enzymes which degrade 

2 0 hydroxylated fatty acids and replacement of 

acyltransf erases which transfer hydroxylated fatty 
acids to the sn-1, sn-2 and sn-3 positions of 
glycerolipids . Although genes for these enzymes have 
not been described in the scientific literature, 

25 their utility in improving the level of production 
of hydroxylated fatty acids can be readily 
appreciated based on the results of biochemical 
investigations of ricinoleate synthesis . 

Although Arabidopsis is not an economically 

30 important plant species, it is widely accepted by 
plant biologists as a model for higher plants. 
Therefore, the inclusion of this example is intended 
to demonstrate the general utility of the invention 
described here to the modification of oil 
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composition in higher plants. One advantage of 
studying the expression of this novel gene in 
Arabldopsis is the existence in this system of a 
large body of knowledge on lipid metabolism, as well 
5 as the availability of a collection of mutants which 
can be used to provide useful information on the 
biochemistry of fatty acid hydroxylat ion in plant 
species. Another advantage is the ease of 
transposing any of the information obtained on 

10 metabolism of ricinoleate in Arabldopsis to closely 
related species such as the crop plants Brassica 
napus , Brassica juncea or Crambe abyss inica in order 
to mass produce ricinoleate, lesqueroleate or other 
hydroxylated fatty acids for industrial use. The 

15 kappa hydroxylase is useful for the production of 
ricinoleate or lesqueroleate in any plant species 
that accumulates significant levels of the 
precursors, oleic acid and icosenoic acid. Of 
particular interest are genetically modified 

20 varieties that accumulate high levels of oleic acid. 
Such varieties are currently available for sunflower 
and canola. Production of lesquerolic acid and 
related hydroxy fatty acids can be achieved in 
species that accumulate high levels of icosenoic 

25 acid or other long chain monoenoic acids. Such 
plants may in the future be produced by genetic 
engineering of plants that do not normally make such 
precursors. Thus, the use of the kappa hydroxylase 
will be of general utility. 
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EXAMPLE 2 a ISOLATION OF LESO UERELLA KAPPA 

HYDROXYLASE GENOMIC CLONE 

Overview 

Regions of nucleotide sequence that were 
5 conserved in both the castor kappa hydroxylase and 
the Arabidopsis fad2 A12 fatty acid desaturase were 
used to design oligonucleotide primers. These were 
used with genomic DNA from Lesquerella fendleri to 
amplify fragments of several homologous genes. These 

10 amplified fragments were then used as hybridization 
probes to identify full length genomic clones from a 
genomic library of L. fendleri . 

Hydroxylated fatty acids are specific to the 
seed tissue of Lesquerella sp., and are not found to 

15 any appreciable extent in vegetative tissues. One of 
the two genes identified by this method was 
expressed in both leaves and developing seeds and is 
therefore thought to correspond to the A12 fatty 
acid desaturase. The other gene was expressed at 

20 high levels in developing seeds but was not 

expressed or was expressed at very low levels in 
leaves and is the kappa hydroxylase from this 
species. The identity of the gene as a fatty acyl 
hydroxylase was established by functional expression 

25 of the gene in yeast. 

The identity of this gene will also be 
established by introducing the gene into transgenic 
Arabidopsis plants and showing that it causes the 
accumulation of ricinoleic acid, lesquerolic acid, 

30 densipolic acid and auricolic acid in seed lipids. 

The various steps involved in this process 
are described in detail below. Unless otherwise 
indicated, routine methods for manipulating nucleic 
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acids, bacteria and phage were as described by 
Sambrook et al . (198 9) . 

Isolation of a fragment of the Le squere 1 la kap pa 
hydroxylase gene 
5 Oligonucleotide primers for the amplification 

of the L. fendleri kappa hydroxylase were designed 
by choosing regions of high deduced amino acid 
sequence homology between the castor kappa 
hydroxylase and the Arabidopsis A12 desaturase 

10 (fad2) . Because most amino acids are encoded by 
several different codons, these oligonucleotides 
were designed to encode all possible codons that 
could encode the corresponding amino acids. 

The sequence of these mixed oligonucleotides 

15 was Oligo 1: TAYWSNCAYMGNMGNCAYCA (SEQ ID NO: 14) and 
Oligo 2: RTGRTGNGCNACRTGNGTRTC (SEQ ID NO: 15) where 
Y = C+T, W = A+T, S = G+C, N = A+G+C+T, M = A+C, and 
R = A+G. 

These oligonucleotides were used to amplify a 
20 fragment of DNA from L . fendleri genomic DNA by the 
polymerase chain reaction (PCR) using the following 
conditions: Approximately 100 ng of genomic DNA was 
added to a solution containing 2 5 pmol of each 
primer, 1.5 U Taq polymerase (Boehringer Manheim) , 
25 200 uM of dNTPs , 50 mM KCl , 10 mM Tris.Cl (pH 9), 
0.1% (v/v) Triton X-100, 1 . 5 mM MgCl 2 , 3% (v/v) 
formamide, to a final volume of 50 /il . 
Amplifications conditions were: 4 min denaturation 
step at 94°C, followed by 30 cycles of 92°C for 1 
30 min, 55°C for 1 min, 72°C for 2 min. A final 

extension step closed the program at 72°C for 5 min. 

PCR products c* approximately 54 0 bp were 
observed following 'e>crrophoretic separation of the 
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products of the PCR reaction in agarose gels. Two of 
these fragments were cloned into pBluescript 

(Stratagene) to give rise to plasmids pLesq2 and 
pLesq3 . The sequence of the inserts in these two 
5 plasmids was determined by the chain termination 
method. The sequence of the insert in pLesq2 is 
presented as Figure 5 (SEQ ID NO:l) and the sequence 
of the insert in pLesq3 is presented as Figure 6 

(SEQ ID NO: 2) . The high degree of sequence identity 
10 between the two clones indicated that they were both 
potential candidates to be either a A12 desaturase 
or a kappa hydroxylase . 

Northern analysis 

In L. fendleri , hydroxylated fatty acids are 

15 found in large amounts in seed oils but are not 

found in appreciable amounts in leaves . An important 
criterion in discriminating between a fatty acyl 
desaturase and kappa hydroxylase is that the kappa 
hydroxylase gene is expected to be expressed more 

20 highly in tissues which have high level of 

hydroxylated fatty acids than in other tissues. In 
contrast, all plant tissues should contain mRNA for 
an uj6 fatty acyl desaturase since diunsaturated 
fatty acids are found in the lipids of all tissues 

25 in most or all plants. 

Therefore, it was of great interest to 
determine whether the gene corresponding to pLesq2 
was also expressed only in seeds, or is also 
expressed in other tissues. This question was 

3 0 addressed by testing for hybridization of pLesq2 to 
RNA purified from developing seeds and from leaves. 

Total RNA was purified from developing seeds 
and young leaves of L . fendleri using an Rneasy RNA 
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extraction kit (Qiagen) , according to the 
manufacturer's instructions. RNA concentrations were 
quantified by UV spectrophotometry at X=260 and 280 
nm. In order to ensure even loading of the gel to be 
5 used for Northern blotting, RNA concentrations were 
further adjusted after recording fluorescence under 
UV light of RNA samples stained with ethidium 
bromide and run on a test denaturing gel. 

Total RNA prepared as described above from 

10 leaves and developing seeds was electrophoresed 

through an agarose gel containing formaldehyde (Iba 
et al . , 1993) . An equal quantity (10 /ig) of RNA was 
loaded in both lanes, and RNA standards (0.16-1.77 
kb ladder, Gibco-BRL) were loaded in a third lane. 

15 Following electrophoresis, RNA was transferred from 
the gel to a nylon membrane (Hybond N+, Amersham) 
and fixed to the filter by exposure to UV light. 

A 32 P- labelled probe was prepared from insert 
DNA of clone pLesq2 by random priming and hybridized 

20 to the membrane overnight at 52°C, after it had been 
prehybridized for 2 h. The prehybridizat ion solution 
contained 5X SSC, 10X Denhardt's solution, 0.1% SDS, 
0 . 1M KP0 4 pH 6.8, 100 fig /ml salmon sperm DNA. The 
hybridization solution had the same basic 

25 composition, but no SDS, and it contained 10% 

dextran sulfate and 30% f ormamide . The blot was 
washed once in 2X SSC, 0.5% SDS at 65°C then in IX 
SSC at the same temperature. 

Brief (30 min) exposure of the blot to X-ray 

30 film revealed that the probe pLesq2 hybridized to a 
single band only in the seed RNA lane (Figure 7) . 
The blot was re-probed with the insert from pLesq3 
gene, which gave bands of similar intensity in the 
seed and leaf lanes 1 Figure 7) . 
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These results show that the gene 
corresponding to the clone pLesq2 is highly and 
specifically expressed in seed of L . fendleri. In 
conjunction with knowledge of the nucleotide and 
5 deduced amino acid sequence, strong seed-specific 
expression of the gene corresponding to the insert 
in pLesq2 is a convincing indicator of the role of 
the enzyme in synthesis of hydroxylated fatty acids 
in the seed oil. 

10 Characterization of a genomic clone of the kappa 
hydroxylase 

Genomic DNA was prepared from young leaves of 
L . fendleri as described by Murray and Thompson 
(1980) . A Sau3AI -partial digest genomic library 

15 constructed in the vector XDashll (Stratagene, 11011 
North Torrey Pines Road, La Jolla CA 92037) was 
prepared by partially digesting 500 /xg of DNA, size- 
selecting the DNA on a sucrose gradient (Sambrook et 
al . , 1989), and ligating the DNA (12 kb average 

2 0 size) to the BamHI -digested arms of XDashll. The 
entire ligation was packaged according to the 
manufacturer's conditions and plated on E. coli 
strain XLl-Blue MRA-P2 (Stratagene) . This yielded 
5x10 s primary recombinant clones. The library was 

25 then amplified according to the manufacturer's 

conditions. A fraction of the genomic library was 
plated on E . coli XLl-Blue and resulting plaques 
(150,000) were lifted to charged nylon membranes 
(Hybond N+ , Amersham) , according to the 

30 manufacturer's conditions. DNA was crosslinked to 

the filters under UV in a Stratalinker (Stratagene) . 

Several clones carrying genomic sequences 
corresponding to the L . fendleri hydroxylase were 
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isolated by probing the membranes with the insert 
from pLesq2 that was PCR-amplif ied with internal 
primers and labelled with 32 P by random priming. The 
filters were prehybridized for 2 hours at 65°C in 7% 
5 SDS, ImM EDTA, 0.25 M Na 2 HP0 4 (pH 7.2), 1% BSA and 
hybridized to the probe for 16 hours in the same 
solution. The filters were sequentially washed at 
65°C in solutions containing 2 X SSC, 1 X SSC , 0.5 X 
SSC in addition to 0.1 % SDS. A 2.6 kb Xbal fragment 

10 containing the complete coding sequence for the 

kappa hydroxylase and approximately 1 kb of the 5' 
upstream region was subcloned into the corresponding 
site of pBluescript KS to produce plasmid pLesq-Hyd 
and the sequence determined completely using an 

15 automatic sequencer by the dideoxy chain termination 
method. Sequence data was analyzed using the program 
DNAS1S (Hitachi Company) . 

The sequence of the insert in clone pLesq-Hyd 
is shown in Figures 8A-B . The sequence entails 18 55 

2 0 bp of contiguous DNA sequence {SEQ ID NO : 3 ) . The 

clone encodes a 401 bp 5' untranslated region (i.e., 
nucleotides preceding the first ATG codon) , an 1152 
bp open reading frame, and a 3 02 bp 3' untranslated 
region. The open reading frame encodes a 384 amino 

25 acid protein with a predicted molecular weight of 
44,370 (SEQ ID NO : 4 ) . The amino terminus lacks 
features of a typical signal peptide (von Heijne, 
1985) . 

The exact translation- initiation methionine 
30 has not been experimentally determined, but on the 

basis of deduced amino acid sequence homology to the 
castor kappa hydroxylase (noted below) is thought to 
be the methionine encoded by the first ATG codon at 
nucleotide 402. 
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Comparison of the pLesq-Hyd deduced amino 
acid sequence with sequences of membrane -bound 
desaturases and the castor hydroxylase (Figures 9A- 
B) indicates that pLesq-Hyd is homologous to these 
5 genes. This figure shows an alignment of the L. 

fendleri hydroxylase (SEQ ID NO: 4) with the castor 
hydroxylase (van de Loo et al . , 1995), the 
Arabidopsis fad2 cDNA which encodes an endoplasmic 
reticulum-localized A12 desaturase (called fad2) 
10 (Okuley et al . , 1994), two soybean fad2 desaturase 

clones, a Brassica napus fad2 clone, a Zea ways fad2 
clone and partial sequence of a J?, communis fad2 
clone . 

The high degree of sequence homology 
15 indicates that the gene products are of similar 

function. For instance, the overall homology between 
the Lesquerella hydroxylase and the Arabidopsis fad2 
desaturase was 92.2% similarity and 84.8% identity 
and the two sequences differed in length by only one 
2 0 amino acid. 

Southern hybridization 

Southern analysis was used to examine the 
copy number of the genes in the L . fendleri genome 
corresponding to the clone pLesq-Hyd. Genomic DNA (5 

25 tig) was digested with EcoRI , HindJII and Xbal and 
separated on a 0.9% agarose gel. DNA was alkali- 
blotted to a charged nylon membrane (Hybond N+, 
Amersham) , according to the manufacturer's protocol. 
The blot was prehybridized for 2 hours at 65 °C in 7% 

30 SDS, ImM EDTA, 0.25 M Na 2 HP0 4 (pH 7.2), 1% BSA and 
hybridized to the probe for 16 hours in the same 
solution with pLesq-Hyd insert PCR-amplif ied with 
internal primers and labelled with 32 P by random 
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priming. The filters were sequentially washed at 
65°C in solutions containing 2 X SSC, 1 X SSC, 0.5 X 
SSC in addition to 0 . 1 % SDS , then exposed to X-ray 
film. 

5 The probe hybridized with a single band in 

each digest of L . fendleri DNA {Figure 10) , 
indicating that the gene from which pLesq-Hyd was 
transcribed is present in a single copy in the L. 
f endl eri genome . 

10 Expression of pLesq-Hyd in Transgenic Plants 

There are a wide variety of plant promoter 
sequences which may be used to cause tissue-specific 
expression of cloned genes in transgenic plants. For 
instance, the napin promoter and the acyl carrier 

15 protein promoters have previously been used in the 
modification of seed oil composition by expression 
of an antisense form of a desaturase (Knutson et 
al . , 1992) . Similarly, the promoter for the /S- 
subunit of soybean /?-conglycinin has been shown to 

20 be highly active and to result in tissue - specif ic 
expression in transgenic plants of species other 
than soybean (Bray et al . , 1987). Thus, other 
promoters which lead to seed- specif ic expression may 
also be employed for the production of modified seed 

25 oil composition. Such modifications of the invention 
described here will be obvious to one skilled in the 
art . 

Constructs for expression of L. fendleri 
kappa hydroxylase in plant cells are prepared as 
30 follows: A 13 kb Sail fragment containing the pLesq- 
Hyg gene was ligated into the Xhol site of binary Ti 
plasmid vector pSLJ44026 (Jones et al., 1992) 
(Figure 11) to produce plasmid pTi-Hyd and 
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transformed into Agrobacterium tumefaciens strains 
GV3101 by elect roporat ion. Strain GV3101 (Koncz and 
Schell, 1986) contains a disarmed Ti plasmid. Cells 
for electroporation were prepared as follows. GV3101 
5 was grown in LB medium with reduced NaCl (5 g/1) . A 
250 ml culture was grown to OD 600 = 0 . 6 , then 
centrifuged at 4000 rpm (Sorvall GS-A rotor) for 15 
min. The supernatant was aspirated immediately from 
the loose pellet, which was gently resuspended in 

10 500 ml ice-cold water. The cells were centrifuged as 
before, resuspended in 30 ml ice-cold water, 
transferred to a 30 ml tube and centrifuged at 5000 
rpm (Sorvall SS-34 rotor) for 5 min. This was 
repeated three times, resuspending the cells 

15 consecutively in 30 ml ice-cold water, 30 ml ice- 
cold 10% glycerol, and finally in 0.75 ml ice-cold 
10% glycerol. These cells were aliquoted, frozen in 
liquid nitrogen, and stored at -80°C. 

Electroporations employed a Biorad Gene 

2 0 Pulser instrument using cold 2 mm-gap cuvettes 

containing 40 fil cells and 1 jxl of DNA in water, at 
a voltage of 2.5 KV, and 200 Ohms resistance. The 
electroporated cells were diluted with 1 ml SOC 
medium (Sambrook et al . , 1989, page A2 ) and 
25 incubated at 28 °C for 2-4 h before plating on medium 
containing kanamycin (50 rng/1) . 

Arabidopsis thaliana can be transformed with 
the Agrobacterium cells containing pTi-Hyd as 
described in Example 1 above. Similarly, the 

3 0 presence of hydroxylated fatty acids in the 

transgeneic Arabidopsis plants can be demonstrated 
by the methods described in Example 1 above. 
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Constitutive expression of the L . fendleri 
hydroxylase in transgenic plants 

A 1.5 kb EcoRI fragment from pLesq-Hyg 
comprising the entire coding region of the 
5 hydroxylase was gel purified, then cloned into the 
corresponding site of pBluescript KS (Stratagene) . 
Plasmid DNA from a number of recombinant clones was 
then restricted with Pert I, which should cut only 
once in the insert and once in the vector polylinker 

10 sequence. Release of a 920 bp fragment with PstI 
indicated the right orientation of the insert for 
further manipulations. DNA from one such clone was 
further restricted with Sal I , the 5 ' ove r hangs 
filled- in with the Klenow fragment of DNA polymerase 

15 I, then cut with Sad . The insert fragment was gel 

purified, and cloned between the Smal and SacI sites 
of pBI121 (Clontech) behind the cauliflower mosaic 
virus 35S promoter. After checking that the sequence 
of the junction between insert and vector DNA was 

20 appropriate, plasmid DNA from a recombinant clone 
was used to transform A . tumefaciens (GV3101) . 
Kanamycin resistant colonies were then used for in 
planta transformation of A. thaliana as previously 
described . 

25 DNA was extracted from kanamycin resistant 

seedlings and used to PCR-amplify selected fragments 
from the hydroxylase using nested primers. When 
fragments of the expected size could be amplified, 
corresponding plants were grown in the greenhouse or 

30 on agar plates, and fatty acids extracted from fully 
expanded leaves, roots and dry seeds. GC-MS analysis 
was then performed as previously described to 
characterize the different fatty acid species and 
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detect accumulation of hydroxy fatty acids in 
transgenic tissues . 

Expression of the Lesauerella hydroxylase in yeast 
In order to demonstrate that the cloned L . 
5 fendleri gene encoded a kappa hydroxylase, the gene 
was expressed in yeast cells under transcriptional 
control of an inducible promoter and the yeast cells 
were examined for the presence of hydroxylated fatty 
acids by GC-MS. 

10 In a first step, a lambda genomic clone 

containing the L, fendleri hydroxylase gene was cut 
with EcoRI , and a resulting 1400 bp fragment 
containing the coding sequence of the hydroxylase 
gene was subcloned in the EcoRI site of the 

15 pBluescript KS vector (Stratagene) . This subclone, 
pLesqcod, contains the coding region of the 
Lesquerella hydroxylase plus some additional 3' 
sequence . 

In a second step, pLesqcod was cut with 

2 0 Hindi 1 1 and XJbal , and the insert fragment was cloned 

into the corresponding sites of the yeast expression 
vector pYes2 (Invitrogen; Figure 12). This subclone, 
pLesqYes, contains the L . fendleri hydroxylase in 
the sense orientation relative to the 3' side of the 
25 Gall promoter. This promoter is inducible by the 

addition of galactose to the growth medium, and is 
repressed upon addition of glucose. In addition, the 
vector carries origins of replication allowing the 
propagation of pLesqYes in both yeast and E. coli. 

3 0 Transformation of S. cerevisiae host strain CGY2557 

Yeast strain CGY2557 {MATot , GAL*, ura.3-52, 
leu2-3, trpl, ade2 - 1 , lys2-l, his5, ca.nl - 100) was 
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grown overnight at 28°C in YPD liquid medium (10 g 
yeast extract, 20 g bacto-peptone, 20 g dextrose per 
liter) , and an aliquot of the culture was inoculated 
into 100 ml fresh YPD medium and grown until the 
5 OD 600 of the culture was 1. Cells were then collected 
by centrif ugation and resuspended in about 200^x1 of 
supernatant. 40^x1 aliquots of the cell suspension 
were then mixed with 1-2/ig DNA and elect roporated in 
2 mm-gap cuvettes using a Biorad Gene Pulser 

10 instrument set at 600 V, 200 Q , 25 /iF, 160/xl YPD was 
added and the cells were plated on selective medium 
containing glucose. Selective medium consisted of 
6.7 g yeast nitrogen base (Difco) , 0.4 g casamino 
acids (Difco), 0.02 g adenine sulfate, 0.03 g L- 

15 leucine, 0.02 g L- tryptophan, 0.03 g L- lysine -HC1 , 
0.03 g L-histidine-HCl , 2% glucose, water to 1 
liter. Plates were solidified using 1.5% Difco 
Bacto-agar. Transformant colonies appeared after 3 
to 4 days incubation at 28°C. 

2 0 Expression of the L. fendleri hydroxylase in yeast 
Independent transformant colonies from the 

previous experiment were used to inoculate 5 ml of 

selective medium containing either 2% glucose (gene 

repressed) or 2% galactose (gene induced) as the 
25 sole carbon source. Independent colonies of CGY2557 

transformed with pYES2 containing no insert were 

used as controls. 

After 2 days of growth at 28°C, an aliquot of 

the cultures was used to inoculate 5 ml of fresh 
30 selective medium. The new culture was placed at 16°C 

and grown for 9 days. 
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Fatty acid analysis of yeast expressing the L . 
f endl eri hydroxy 1 a se 

Cells from 2.5 ml of culture were pelleted at 
18 0 0g, and the supernatant was aspirated as 
5 completely as possible. Pellets were then dispersed 
in 1 ml of 1 N methanolic HCl (Supelco, Bellafonte, 
PA) . Transmethylation and derivat izat ion of hydroxy 
fatty acids were performed as described above. After 
drying under nitrogen, samples were redissolved in 

10 50/xl chloroform before being analyzed by GC-MS. 

Samples were injected into an SP2330 fused-silica 
capillary column (30 m x 0.25 mm ID, 0.25/xm film 
thickness, Supelco). The temperature profile was 100 
- 160°C, 25°C/min / 160 - 230°C, 10°C/min / 230°C / 3 

15 min, 230-100°C, 25°C/min. Flow rate was 0.9 ml /min. 
Fatty acids were analyzed using a Hewlett-Packard 
5971 series Msdetector. 

Gas chromatograms of derivatized fatty acid 
methyl esters from induced cultures of yeast 

20 containing pLesqYes contained a novel peak that 
eluted at 7.6 min (Figure 13). 0-TMS methyl 
ricinoleate eluted at exactly the same position on 
control chromatograms. This peak was not present in 
cultures lacking pLesqYes or in cultures containing 

25 pLesqYes grown on glucose (repressing conditions) 
rather than galactose (inducing conditions) . Mass 
spectrometry of the peak (Figure 13) revealed that 
the peak has the same spectrum as 0-TMS methyl 
ricinoleate. Thus, on the basis of chromatographic 

30 retention time and mass spectrum, it was concluded 
that the peak corresponded to O-TMS methyl 
ricinoleate. The presence of ricinoleate in the 
transgenic yeast cultures confirms the identity of 
the gene as a kappa hydroxylase of this invention. 
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EXAMPLE 3 . OBTAINING OTHER PLANT FATTY ACYL 
HYDROXYLASES 

The castor fahl2 sequence could be used to 
identify other kappa hydroxylases by methods such as 
5 PCR and heterologous hybridization. However, because 
of the high degree of sequence similarity between 
A12 desaturases and kappa hydroxylases, the prior 
art does not teach how to distinguish between the 
two kinds of enzymes without a functional test such 

10 as demonstrating activity in transgenic plants or 

another suitable host (e.g., transgenic microbial or 
animal cells) . The identification of the L . fendleri 
hydroxylase provided for the development of criteria 
by which a hydroxylase and a desaturase may be 

15 distinguished solely on the basis of deduced amino 
acid sequence information. 

Figures 9A-B show a sequence alignment of the 
castor and L. fendleri hydroxylase sequences with 
the castor hydroxylase sequence and all publicly 

20 available sequences for all plant microsomal A12 
fatty acid desaturases. Of the 384 amino acid 
residues in the castor hydroxylase sequence, more 
than 95% are identical to the corresponding residue 
in at least one of the desaturase sequences. 

2 5 Therefore, none of these residues are responsible 

for the catalytic differences between the 
hydroxylase and the desaturases. Of the remaining 16 
residues in the castor hydroxylase and 14 residues 
in the Lesquerella. hydroxylase, all but seven 

3 0 represent instances where the hydroxylase sequence 

has a conservative substitution compared with one or 
more of the desaturase sequences, or there is wide 
variability in the amino acid at that position in 
the various desaturases. By conservative, it is 
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meant that the following amino acids are 
functionally equivalent: Ser/Thr, Ile/Leu/Val/Met , 
Asp/Glu. Thus, these structural differences also 
cannot account for the catalytic differences between 
5 the desaturases and hydroxylases. This leaves just 
seven amino acid residues where both the castor 
hydroxylase and the Lesguerella. hydroxylase differ 
from all of the known desaturases and where all of 
the known microsomal A12 desaturases have the 

10 identical amino acid residue. These residues occur 
at positions 69, 111, 155, 226, 304, 331 and 333 of 
the alignment in Figure 9. Therefore, these seven 
sites distinguish hydroxylases from desaturases. 
Based on this analysis, the present inventors 

15 believe that any enzyme with greater than 60% 

sequence identity to one of the enzymes listed in 
Figure 9 can be classified as a hydroxylase if it 
differs from the sequence of the desaturases at 
these seven positions. Because of slight differences 

20 in the number of residues in a particular protein, 
the numbering may vary from protein to protein but 
the intent of the number system will be evident if 
the protein in question is aligned with the castor 
hydroxylase using the numbering system shown herein. 

25 Thus, in conjunction with the methods for using the 
Lesquerella hydroxylase gene to isolate homologous 
genes, the structural criterion disclosed here 
teaches how to isolate and identify plant kappa 
hydroxylase genes for the purpose of genetically 

30 modifying fatty acid composition as disclosed 
herein . 
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EXAMPLE 4 - USING HYDROXYLASES TO ALTER THE LEVEL OF 
FATTY ACID UNS ATURAT I ON 

Evidence that kappa hydroxylases of this 
invention can be used to alter the level of fatty 
5 acid unsaturation was obtained from the analysis of 
transgenic plants that expressed the castor 
hydroxylase under control of the cauliflower mosaic 
virus promoter. The construction of the plasmids and 
the production of transgenic Arabidopsis plants was 
10 described in Example 1 (above) . The fatty acid 

composition of seed lipids from wild type and six 
transgenic lines (1-2/a, 1-2/b, 1-3/b, 4F, 7E, 7F) 
is shown in Table 2 . 

Table 2. Fatty acid composition of lipids from 
15 Arabidopsis seeds. The asterisk (*) indicates that 
for some of these samples, the 18:3 and 20:1 peaks 
overlapped on the gas chromatograph and, therefore, 
the total amount of these two fatty acids is 
reported . 
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The results in Table 2 show that expression 
of the castor hydroxylase in transgenic Arahidopsis 
plants caused a substantial increase in the amount 
of oleic acid (18:1) in the seed lipids and an 
5 approximately corresponding decrease in the amount 

of linoleic acid (18:2). The average amount of oleic 
acid in the six transgenic lines was 29.9% versus 
14.7% in the wild type. 

The precise mechanism by which expression of 

10 the castor hydroxylase gene causes increased 

accumulation of oleic acid is not known. However, an 
understanding of the mechanism is not required in 
order to exploit this invention for the directed 
alteration of plant lipid fatty acid composition. 

15 Furthermore, it will be recognized by one skilled in 
the art that many improvements of this invention may 
be envisioned. Of particular interest will be the 
use of other promoters which have high levels of 
seed-specific expression . 

20 Since hydroxylated fatty acids were not 

detected in the seed lipids of transgenic line l-2b, 
it seems likely that it is not the presence of 
hydroxylated fatty acids per se that causes the 
effect of the castor hydroxylase gene on desaturase 

25 activity. Protein-protein interaction between the 

hydroxylase and the A12-oleate desaturase or another 
protein may be required for the overall reaction 
(e.g., cytochrome b5) or for the regulation of 
desaturase activity. For example, interaction 

3 0 between the hydroxylase and this other protein may 
suppress the activity of the desaturase. In 
particular, the quaternary structure of the 
membrane -bound desaturases has not been established. 
It is possible that these enzymes are active as 
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dimers or as multimeric complexes containing more 
than two subunits . Thus, if dimers or multimers form 
between the desaturase and the hydroxylase, the 
presence of the hydroxylase in the complex may 
5 disrupt the activity of the desaturase. 

Transgenic plants may be produced in which 
the hydroxylase enzyme has been rendered inactive by 
the elimination of one or more of the histidine 
residues that have been proposed to bind iron 

10 molecules required for catalysis. Several of these 
histidine residues have been shown to be essential 
for desaturase activity by site directed mutagenesis 
(Shanklin et al . , 1994) . Codons encoding histidine 
residues in the castor hydroxylase gene will be 

15 changed to alanine residues as described by Shanklin 
et al . (1994) . The modified genes will be introduced 
into transgenic plants of Arabidopsis , and possibly 
other species such as tobacco, by the methods 
described in Example 1 of this application. 

2 0 In order to examine the effect on all 

tissues, the strong constitutive cauliflower mosaic 
virus promoter may be used to cause transcription of 
the modified genes. However, it will be recognized 
that in order to specifically examine the effect of 

2 5 expression of the mutant gene on seed lipids, a 

seed-specific promoter such as the B . napus napin 
promoter may be used. An expected outcome is that 
expression of the inactive hydroxylase protein in 
transgenic plants will inhibit the activity of the 

30 endoplasmic reticulum-localized A12 -desaturase . 

Maximum inhibition will be obtained by expressing 
high levels of the mutant protein. 

In a further embodiment of this invention, 
mutations that inactivate other hydroxylases, such 
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as the Lesquerella hydroxylase of this invention, 
may also be useful for decreasing the amount of 
endoplasmic reticulum-localized A12 -desaturase 
activity in the same way as the castor gene. In a 
5 further embodiment of this invention, similar 

mutations of desaturase genes may also be used to 
inactivate endogenous desaturases. Thus, expression 
of catalytically inactive fad2 gene from Arabidopsis 
in transgenic Arabidopsis may inhibit the activity 
10 of the endogenous fad2 gene product. 

Similarly, expression of the catalytically 
inactive forms of A12 -desaturase from Arabidopsis or 
other plants in transgenic soybean, rapeseed, 
Crambe, Brassica juncea, canola, flax, sunflower, 
15 saf flower, cotton, cuphea, soybean, peanut, coconut, 
oil palm or corn may lead to inactivation of 
endogenous A12 -desaturase activity in these plants. 
In a further embodiment of this invention, 
expression of catalytically inactive forms of other 
20 desaturases such as the A15 -desaturases may lead to 
inactivation of the corresponding desaturases. 

An example of a class of mutants useful in 
the present invention are "dominant negative" 
mutants that block the function of a gene at the 
25 protein level (Herskowitz, 1987) . A cloned gene is 
altered so that it encodes a mutant product capable 
of inhibiting the wild type gene product in a cell, 
thus causing the cell to be deficient in the 
function of that gene product. Inhibitory variants 
30 of a wild type product can be designed because 

proteins have multiple functional domains that can 
be mutated independently, e.g., oligomerizat ion, 
substrate binding, catalysis, membrane association 
domains or the like. In general, dominant negative 
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proteins retain an intact, functional subset of the 
domains of the parent, wild type protein, but have 
the complement of that subset either missing or 
altered so as to be nonfunctional. 
5 Whatever the precise basis for the inhibitory 

effect of the castor hydroxylase on desaturat ion , 
because the castor hydroxylase has very low 
nucleotide sequence homology (i.e., about 67%) to 
the Arabidopsis fad2 gene (encoding the endoplasmic 

10 ret iculum-localized A12 -desaturase ) , the inhibitory 
effect of this gene, which is provisionally called 
"protein-mediated inhibition" ( "protibit ion" ) , may 
have broad utility because it does not depend on a 
high degree of nucleotide sequence homology between 

15 the transgene and the endogenous target gene. In 
particular, the castor hydroxylase may be used to 
inhibit the endoplasmic reticulum- localized A12- 
desaturase activity of all higher plants. Of 
particular relevance are those species used for oil 

20 production. These include but are not limited to 
rapeseed, Crambe , Brassica juncea, canola, flax, 
sunflower, safflower, cotton, cuphea, soybean, 
peanut, coconut, oil palm and corn. 

CONCLUDING REMARKS 

25 By the above examples, demonstration of 

critical factors in the production of novel 
hydroxylated fatty acids by expression of a kappa 
hydroxylase gene from castor in transgenic plants is 
described. In addition, a complete cDNA sequence of 

3 0 the Lesquerella fendleri kappa hydroxylase is also 
provided. A full sequence of the castor hydroxylase 
is also given with various constructs for use in 
host cells. Through this invention, one can obtain 
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the amino acid and nucleic acid sequences which 
encode plant fatty acyl hydroxylases from a variety 
of sources and for a variety of applications. Also 
revealed is a novel method by which the level of 
5 fatty acid desaturation can be altered in a directed 
way through the use of genetically altered 
hydroxylase or desaturase genes . 

All pub 1 i cat ions mentioned in this 
specification are indicative of the level of skill 

10 of those skilled in the art to which this invention 
pertains. All publications are herein incorporated 
by reference to the same extent as if each 
individual publication was specifically and 
individually indicated to be incorporated by 

15 reference. 

Although the foregoing invention has been 
described in some detail by way of illustration and 
example for purposes of clarity of understanding, it 
will be obvious that certain changes and 

20 modifications may be practiced within the scope of 
the appended claims . 
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SEQUENCE LISTING 
{1} GENERAL INFORMATION: 

(i) APPLICANT: Somerville, Chris 

Broun, Pierre 
van de Loo, Frank 
Boddupalli, Sekhar S. 

(ii) TITLE OF INVENTION: Production of Hydroxy lated 
Fatty Acids in Genetically Modified Plants 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PILLSBURY MADISON Sc SUTRO 

(B) STREET: 1100 NEW YORK AVENUE, N.W. 

(C) CITY: WASHINGTON 

(D) STATE: D.C. 

(E) COUNTRY: USA 

(F) ZIP: 20005-3918 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch, 1.44 MB storage 

(B) COMPUTER: IBM compatible 

(C) OPERATING SYSTEM: DOS 5.0 

(D) SOFTWARE: Word Perfect 5.1 

(vi) CURRENT APPLICATION DATA; 

(A) APPLICATION NUMBER: not yet assigned 

(B) FILING DATE: February 6, 1997 

( C ) CLAS S I F I CAT I ON : 



(2) INFORMATION FOR SEQ ID NO:l 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 543 nucleotides 

(B) TYPE: nucleotide 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

TATTGGCACC GGCGGCACCA TTCCAACAAT GGATCCCTAG 4 0 

AAAAAGATGA AGTCTTTGTC CCACCTAAGA AAGCTGCAGT 80 

CAN AT GGTAT GTCAAATACC TCAACAACCC TCTTGGACGC 12 0 



ATTCTGGTGT TAACAGTTCA GTTTATCCTC GGGTGGCCTT 16 0 
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/"I fTl rri r t t T\ J\ i i i r'" i » r i *T\ 

Lil lAAib 1 A 


rCAGGTAGAC 


CTTATGATGG 


200 


111 ULjL. i 1 L.A 


LAI I ILITCC 


CT C ATG C AC C 


TATCTTTAAG 


240 




Li 1 V- 1 L-V-AtjAl 


AiALATCTCA 


GATGCTGGTA 


280 


1 I L I ALtL Ikj 1 


L- ILjI 1 A1Cj(j-1 


CTTTACCGTT 


ACGCTGCTTC 


320 


ACAAGGATTG 


ACTGCTATGA 


TCTGCGTCTA 


CGGAGTACCG 


360 


CTTTTGATAG 


TGAACTTTTT 


CCTTGTCTTG 


GTCACTTTCT 


400 


TGCAGCACAC 


TCATCCTTCA 


TTACCTCACT 


ATG ATT C AAC 


440 


CGAGTGGGAA 


TGGATTAGAG 


GAG CTTTGGT 


TACGGTAGAC 


480 


AGAGACTATG 


GAATCTTGAA 


CAAGGTGTTT 


CACAACATAA 


520 


CAGACACCCA 


CGTAGCACAC 


CAC 




543 



(2) INFORMATION FOR SEQ ID NO : 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 nucleotides 

(B) TYPE : nucleotide 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

TATAGG CACC GGAGG CAC C A TTCCAACACA GGATCCCTCG 4 0 

AAAGAGATGA AGTATTTGTC C C AAAGC AG A AATCCGCAAT 8 0 

CAAGTGGTAC GGCGAATACC TCAACAACCC TCCTGGTCGC 12 0 

ATCATGATGT TAACTGTCCA GTTCGTCCTC GGATGGCCCT 16 0 

TGTACTTAGC CTTCAACGTT TCTGGCAGAC CCTACAATGG 20 0 

TTTCGCTTCC CATTTCTTCC CCAATGCTCC TATCTACAAC 24 0 

G AC CGTGAAC GCCTCCAGAT TTACATCTCT GATGCTGGTA 28 0 

TTCTAGCCGT CTGTTATGGT CTTTACCGTT ACGCTGTTGC 32 0 

ACAAGGACTA GCCTCAATGA T CT GT CT AAA CGGAGTTCCG 3 60 

CTTCTGATAG TTAACTTTTT CCTCGTCTTG ATCACTTACT 400 
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TACAACACAC TCACCCTGCG TTGCCTCACT ATGATTCATC 44 0 

AGAGTGGGAT TGGCTTAGAG GAGCTTTAGC TACTGTAGAC 4 80 

AGAGACTATG GAATCTTGAA CAAGGTGTTC CATAACATCA 52 0 

CAGACACCCA CGTCGCACAC CACT 544 

(2) INFORMATION FOR SEQ ID NO : 3 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1855 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



ATGAAGCTTT 


ATAAGAAGTT 


AGTTTTCTCT 


GGTGACAGAG 


40 


AAATTNTGTC 


AATTGGTAGT 


GACAGTTGAA 


GCAACAGGAA 


80 


CAACAAGGAT 


GGTTGGTGNT 


GATGCTGATG 


TGGTGATGTG 


120 


TT ATT CATC A 


AATACTAAAT 


ACTACATTAC 


TTGTTGCTGC 


160 


CTACTTCTCC 


TATTTCCTCC 


GCCACCCATT 


TTGGACCCAC 


200 


GANCCTTCCA 


TTTAAACCCT 


CTCTCGTGCT 


ATTCACCAGA 


240 


AGAGAAGCCA 


AGAGAGAGAG 


AGAGAGAATG 


TT CTGAGGAT 


280 


CATTGTCTTC 


TTCATCGTTA 


TTAACGTAAG 


TTTTTTTTGA 


320 


CCACTCATAT 


CTAAAATCTA 


GTACATGCAA 


TAGATTAATG 


360 


ACTGTTCCTT 


CTTTTGATAT 


TTTCAGCTTC 


TTGAATTCAA 


400 


GATGGGTGCT 


GGTGGAAGAA 


TAATGGTTAC 


CCCCTCTTCC 


440 


AAGAAATCAG 


AAACTGAAGC 


CCTAAAACGT 


GGACCATGTG 


480 


AGAAACCACC 


ATTCACTGTT 


AAAGATCTGA 


AGAAAGCAAT 


520 


CCCACAGCAT 


TGTTTCAAGC 


GCTCTATCCC 


TCGTTCTTTC 


560 


TCCTACCTTC 


TCACAGATAT 


CACTTTAGTT 


TCTTGCTTCT 


600 


ACT ACGTTG C 


CACAAATTAC 


TTCTCTCTTC 


TTCCTCAGCC 


640 
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TCTCTCTACT 


TACCTAGPTT 


PPPPTPTPTA 

www w X w X w X r\ 


x x LjLjVj X A 1 w 1 


680 


CAAGGCTGTG 


TCTTAAPCGG 

x w x x rxriv-'wvjw 


X r\ X w X w*ww x w 




72 C 


AATGTGGTCA 


CPATGPATTP 

w < — X w w./i. X J. w 


.riw X wA w 1A1L 


AATGGGTAGA 


760 


TGAPAPTPTT 
x x w x x 


P P T T TT A T P T 

wVj X X X X j\ 1 L i 


1 w w A lilll 1 


CCTTCTCGTC 


800 


v_. ■> — . X i. -ii. — X X n — X 


P PTPP A A A T A 


w-rtw 1 L.A X ww 1 


w bi CACCATT 


84 0 


C C AAC AATGG 




A A ap a tp a ap 

fifirtuA 1 wAAw* 


TP riff 1 nn /— t rp / — 1 


880 


APPGAAGAAA 


ww X ULriU X w,r\ 


a atppt a tpt 


I AAATAC CT C 


920 




TTPP A P cx P a t 

X X wwrt.w wwi-1 X 


TPTPPTP TT * 
± w 1 ww lbl 1 A 


ACAGTTCAGT 


960 


TTATCCTCGG 

X X xi X n— "w X W VJ VJ 


w X WW w w X X X w 


TATPTZiPPPT 


■'f"P7A A TPT ATP 


-1 r\ r~\ r\ 

10 00 


AGGTAGACCT 

**NJVJ x -r^w^^i. w w x 


TATPATPPTT 

x xt. x un x ww x x 


TPPPTTP APA 

X www X X v,riLH 


A 1 1 w X 1 www 1 


1U4U 


C ATG C AC CT A 


TCTTTAAAGA 

X W- XXX iiTiliVJ/T. 


PPGAGAAPPP 

w w w rt. UnA w w w 


PTPPAPRTZiT 
w X w Lnbn iHl 


n r\ 0 r\ 


APATPTCAGA 


TGPTGGTATT 

XwwX wwX/AX X 


PTAGPTPTPT 
w X r\\jj w 1 vj X w X 


bl 1H1 KjKj 1 w X 


llz U 


TTACCGTTAC 


GPTGPTTrAP 

ww X w w X X V /~i.W 


A A pp a ttp a p 


TPPTSTPATP 


1 1 /"A 

lloU 


TGCGTCTATG 

X v-J V- >»J X W X X VJ 


GAGTAPPGPT 

WX^W X ^"X w v— w w X 


XXX wx\ X rlu X w 


A A P'T' I'T'I'TPP 
nnb 11111 ww 


-t o n n 


TTGTCTTGGT 

X X \J X W> X X WW X 


AAGTTTGTTG 

rvcvu xx x w x x w 


PAGPAPAPTP 


ATPP'V'I'PP'I"! 1 
-rt. X ww 1 X bb 1 1 


1 n yi a 


ACCTCATTAT 

W W X ^ — -XX X X *i X 


GATTGAAGPG 

W X X w /"XT! \ — \ — i 


x^w X w w wr-i/A X w 


CI A T"T A P A rp A 
wrA x 1 


1 *") q n 


GCTTTGGTTA 

»— xxx ww x x ri 


rGGTAGATAG 

wvjw X / vvJJ/T. \ ^Ivj 


APA PTATPPZi 


ATA TTP A A PA 
A 1 A 1 1 bAALA 




AGGTGTT CP A 

«WW X VJ X X v_- Cfi 


TAAPATA APA 


PAPA PAP ATP 


TPPPTP ATP A 
1 Lj ww x LH 1 wA 


i "3 c n 
libU 


x w x w xxx vj^n 


APT AT APPPP 

.TTLW X /"i X X"\.W WWW 


ATT AT A A PPP 


A A TPP A A PPT 
Art. x buHAbb 1 


t / n n 
14 U U 


ACAGAGG CG A 


TAAAGPPAAT 


/Aw X X ww X wri X 


T A PT7A PP A PT 
1 /Aw 1 Aw w A w 1 


~\ a a r\ 


TCGATGGAAC 


AC CG TG G TAT 


GTGGCCATGT 


ATAGGGAAGC 


1480 


AAAGGAGTGT 


CTCTATGTAG 


AACCGGATAC 


GGAACGTGGG 


1520 


AAGAAAGGTG 


TCTACTATTA 


CAACAATAAG 


TTATGAGGCT 


1560 


GATAGGGCGA 


GAGAAGT G C A 


ATTATCAATC 


TTCATTTCCA 


1600 


TGTTTTAGGT 


GTCTTGTTTA 


AG AAG CTATG 


CTTTGTTT C A 


1640 


ATAATCT CAG 


AGTCCATNTA 


GTTGTGTTCT 


GGTGCATTTT 


1680 
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GCCTAGTTAT GTGGTGTCGG AAGTTAGTGT TCAAACTGCT 172 0 

TCCTGCTGTG CTGCCCAGTG AAGAACAAGT TTACGTGTTT 176 0 

AAAAT ACT CG GAACGAATTG ACCACAANAT ATCCAAAACC 18 00 

GGCTATCCGA ATTCCATATC CGAAAACCGG ATATCCAAAT 184 0 

T T C C AG AG T A CTTAG 185 5 

(2) INFORMATION FOR SEQ ID NO : 4 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 84 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Gly Ala Gly Gly Arg lie Met Val Thr 

5 10 

Pro Ser Ser Lys Lys Ser Glu Thr Glu Ala 

15 20 

Leu Lys Arg Gly Pro Cys Glu Lys Pro Pro 

25 30 

Phe Thr Val Lys Asp Leu Lys Lys Ala lie 

35 40 

Pro Gin His Cys Phe Lys Arg Ser lie Pro 

45 50 

Arg Ser Phe Ser Tyr Leu Leu Thr Asp lie 

55 60 

Thr Leu Val Ser Cys Phe Tyr Tyr Val Ala 

65 70 

Thr Asn Tyr Phe Ser Leu Leu Pro Gin Pro 

75 80 

Leu Ser Thr Tyr Leu Ala Trp Pro Leu Tyr 

85 90 

Trp Val Cys Gin Gly Cys Val Leu Thr Gly 

95 100 
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lie Trp Val lie Gly His Glu Cys Gly His 

105 no 

His Ala Phe Ser Asp Tyr Gin Trp Val Asp 

115 ~ 120 

Asp Thr Val Gly Phe lie Phe His Ser Phe 

125 130 

Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr 

13 5 ~ " 14 0 

Ser His Arg Arg His His Ser Asn Asn Gly 

145 150 

Ser Leu Glu Lys Asp Glu Val Phe Val Pro 

155 160 

Pro Lys Lys Ala Ala Val Lys Trp Tyr Val 

165 170 

Lys Tyr Leu Asn Asn Pro Leu Gly Arg lie 

175 180 

Leu Val Leu Thr Val Gin Phe lie Leu Gly 

185 190 

Trp Pro Leu Tyr Leu Ala Phe Asn Val Ser 

195 200 

Gly Arg Pro Tyr Asp Gly Phe Ala Ser His 

205 210 

Phe Phe Pro His Ala Pro lie Phe Lys Asp 

215 220 

Arg Glu Arg Leu Gin lie Tyr lie Ser Asp 

225 * 230 

Ala Gly lie Leu Ala Val Cys Tyr Gly Leu 

235 240 

Tyr Arg Tyr Ala Ala Ser Gin Gly Leu Thr 

245 250 

Ala Met lie Cys Val Tyr Gly Val Pro Leu 

255 260 

Leu lie Val Asn Phe Phe Leu Val Leu Val 

265 270 
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Thr Phe Leu Gin His Thr His Pro Ser Leu 

275 280 

Pro His Tyr Asp Ser Thr Glu Trp Glu Trp 

285 290 

lie Arg Gly Ala Leu Val Thr Val Asp Arg 

295 300 

Asp Tyr Gly lie Leu Asn Lys Val Phe His 

305 310 

Asn lie Thr Asp Thr His Val Ala His His 

315 320 

Leu Phe Ala Thr lie Pro His Tyr Asn Ala 

325 330 

Met Glu Ala Thr Glu Ala lie Lys Pro lie 

335 340 

Leu Gly Asp Tyr Tyr His Phe Asp Gly Thr 

345 350 

Pro Trp Tyr Val Ala Met Tyr Arg Glu Ala 

355 360 

Lys Glu Cys Leu Tyr Val Glu Pro Asp Thr 

365 370 

Glu Arg Gly Lys Lys Gly Val Tyr Tyr Tyr 

375 380 

Asn Asn Lys Leu 



(2) INFORMATION FOR SEQ ID NO: 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 87 amino acids 

(B) TYPE : amino acid 
{ C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Gly Gly Gly Gly Arg Met Ser Thr Val 

5 10 

lie Thr Ser Asn Asn Ser Glu Lys Lys Gly 

15 20 
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Gly Ser Ser His Leu Lys Arg Ala Pro His 

25 ' 30 

Thr Lys Pro Pro Phe Thr Leu Gly Asp Leu 

35 * 40 

Lys Arg Ala lie Pro Pro His Cys Phe Glu 

45 50 

Arg Ser Phe Val Arg Ser Phe Ser Tyr Val 

5 5 6 0 

Ala Tyr Asp Val Cys Leu Ser Phe Leu Phe 

65 70 

Tyr Ser lie Ala Thr Asn Phe Phe Pro Tyr 

75 80 

lie Ser Ser Pro Leu Ser Tyr Val Ala Trp 

85 90 

Leu Val Tyr Trp Leu Phe Gin Gly Cys lie 

95 100 

Leu Thr Gly Leu Trp Val lie Gly His Glu 

105 110 

Cys Gly His His Ala Phe Ser Glu Tyr Gin 

115 120 

Leu Ala Asp Asp lie Val Gly Leu lie Val 

125 130 

His Ser Ala Leu Leu Val Pro Tyr Phe Ser 

135 140 

Trp Lys Tyr Ser His Arg Arg His His Ser 

14 5 ~ 150 

Asn lie Gly Ser Leu Glu Arg Asp Glu Val 

155 160 

Phe Val Pro Lys Ser Lys Ser Lys lie Ser 

165 170 

Trp Tyr Ser Lys Tyr Ser Asn Asn Pro Pro 

175 180 



Gly Arg Val Leu Thr Leu Ala Ala Thr Leu 

185 190 
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Leu Leu Gly Trp Pro Leu Tyr Leu Ala Phe 

195 - 200 

Asn Val Ser Gly Arg Pro Tyr Asp Arg Phe 

205 ~ ' 210 

Ala Cys His Tyr Asp Pro Tyr Gly Pro lie 

215 220 

Phe Ser Glu Arg Glu Arg Leu Gin lie Tyr 

225 230 

lie Ala Asp Leu Gly lie Phe Ala Thr Thr 

235 240 

Phe Val Leu Tyr Gin Ala Thr Met Ala Lys 

245 250 

Gly Leu Ala Trp Val Met Arg lie Tyr Gly 

255 260 

Val Pro Leu Leu lie Val Asn Cys Phe Leu 

265 270 

Val Met lie Thr Tyr Leu Gin His Thr His 

275 280 

Pro Ala lie Pro Arg Tyr Gly Ser Ser Glu 

285 290 

Trp Asp Trp Leu Arg Gly Ala Met Val Thr 

295 300 

Val Asp Arg Asp Tyr Gly Val Leu Asn Lys 

305 310 

Val Phe His Asn He Ala Asp Thr His Val 

315 320 

Ala His His Leu Phe Ala Thr Val Pro His 

325 330 

Tyr His Ala Met Glu Ala Thr Lys Ala He 

335 340 

Lys Pro He Met Gly Glu Tyr Tyr Arg Tyr 

345 350 

Asp Gly Thr Pro Phe Tyr Lys Ala Leu Trp 

355 ^ ' 360 
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Arg Glu Ala Lys Glu Cys Leu Phe Val Glu 

365 370 

Pro Asp Glu Gly Ala Pro Thr Gin Gly Val 

375 380 

Phe Trp Tyr Arg Asn Lys Tyr 

3 85 

(2) INFORMATION FOR SEQ ID NO : 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Gly Ala Gly Gly Arg Met Pro Val Pro 

5 10 

Thr Ser Ser Lys Lys Ser Glu Thr Asp Thr 

15 ^ 20 

Thr Lys Arg Val Pro Cys Glu Lys Pro Pro 

25 30 

Phe Ser Val Gly Asp Leu Lys Lys Ala lie 

3 5 ' 4 0 

Pro Pro His Cys Phe Lys Arg Ser lie Pro 

45 - 50 

Arg Ser Phe Ser Tyr Leu lie Ser Asp lie 

5 5 6 0 

lie lie Ala Ser Cys Phe Tyr Tyr Val Ala 

65 70 

Thr Asn Tyr Phe Ser Leu Leu Pro Gin Pro 

75 80 

Leu Ser Tyr Leu Ala Trp Pro Leu Tyr Trp 

85 90 

Ala Cys Gin Gly Cys Val Leu Thr Gly lie 

95 100 

Trp Val lie Ala His Glu Cys Gly His His 

105 110 
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Ala Phe Ser Asp Tyr Gin Trp Leu Asp Asp 

115 ' " 120 

Thr Val Gly Leu lie Phe His Ser Phe Leu 

125 130 

Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser 

135 * 140 

His Arg Arg His His Ser Asn Thr Gly Ser 

145 ^ 150 

Leu Glu Arg Asp Glu Val Phe Val Pro Lys 

155 160 

Gin Lys Ser Ala lie Lys Trp Tyr Gly Lys 

165 " ~ 170 

Tyr Leu Asn Asn Pro Leu Gly Arg lie Met 

175 ^ 180 

Met Leu Thr Val Gin Phe Val Leu Gly Trp 

185 190 

Pro Leu Tyr Leu Ala Phe Asn Val Ser Gly 

195 200 

Arg Pro Tyr Asp Gly Phe Ala Cys His Phe 

205 210 

Phe Pro Asn Ala Pro lie Tyr Asn Asp Arg 

215 220 

Glu Arg Leu Gin lie Tyr Leu Ser Asp Ala 

225 230 

Gly lie Leu Ala Val Cys Phe Gly Leu Tyr 

235 240 

Arg Tyr Ala Ala Ala Gin Gly Met Ala Ser 

245 250 

Met lie Cys Leu Tyr Gly Val Pro Leu Leu 

255 260 

lie Val Asn Ala Phe Leu Val Leu He Thr 

265 270 

Tyr Leu Gin His Thr His Pro Ser Leu Pro 

275 280 
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His Tyr Asp Ser Ser Glu Trp Asp Trp Leu 

285 ~ ~ 290 

Arg Gly Ala Leu Ala Thr Val Asp Arg Asp 

295 300 

Tyr Gly lie Leu Asn Lys Val Phe His Asn 

305 310 

lie Thr Asp Thr His Val Ala His His Leu 

315 320 

Phe Ser Thr Met Pro His Tyr Asn Ala Met 

325 330 

Glu Ala Thr Lys Ala lie Lys Pro lie Leu 

335 " 340 

Gly Asp Tyr Tyr Gin Phe Asp Gly Thr Pro 

345 350 

Trp Tyr Val Ala Met Tyr Arg Glu Ala Lys 

355 " 360 

Glu Cys lie Tyr Val Glu Pro Asp Arg Glu 

365 370 

Gly Asp Lys Lys Gly Val Tyr Trp Tyr Asn 

375 * ~ 380 

Asn Lys Leu 



(2) INFORMATION FOR SEQ ID NO : 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Gly Ala Gly Gly Arg Met Gin Val Ser 

5 10 

Pro Pro Ser Lys Lys Ser Glu Thr Asp Asn 

15 20 



lie Lys Arg Val Pro Cys Glu 

25 



Thr Pro Pro 

30 
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Phe Thr Val Gly Glu Leu Lys Lys Ala lie 

35 4 0 

Pro Pro His Cys Phe Lys Arg Ser lie Pro 

45 ~ ~ 50 

Arg Ser Phe Ser His Leu lie Trp Asp lie 

55 ^ 60 

lie lie Ala Ser Cys Phe Tyr Tyr Val Ala 

65 ~ 70 

Thr Thr Tyr Phe Pro Leu Leu Pro Asn Pro 

75 80 

Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp 

85 " 90 

Ala Cys Gin Gly Cys Val Leu Thr Gly Val 

95 100 

Trp Val lie Ala His Glu Cys Gly His Ala 

105 110 

Ala Phe Ser Asp Tyr Gin Trp Leu Asp Asp 

115 ^ 120 

Thr Val Gly Leu lie Phe His Ser Phe Leu 

125 130 

Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser 

135 * ^ ~ 140 

His Arg Arg His His Ser Asn Thr Gly Ser 

14 5 " 15 0 

Leu Glu Arg Asp Glu Val Phe Val Pro Arg 

155 160 

Arg Ser Gin Thr Ser Ser Gly Thr Ala Ser 

165 170 

Thr Ser Thr Thr Phe Gly Arg Thr Val Met 

175 180 

Leu Thr Val Gin Phe Thr Leu Gly Trp Pro 

185 190 

Leu Tyr Leu Ala Phe Asn Val Ser Gly Arg 

195 " 200 
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Pro Tyr Asp Gly Gly Phe Ala Cys His Phe 

205 210 

His Pro Asn Ala Pro lie Tyr Asn Asp Arg 

215 * " 220 

Glu Arg Leu Gin lie Tyr lie Ser Asp Ala 

225 230 

Gly lie Leu Ala Val Cys Tyr Gly Leu Leu 

235 240 

Pro Tyr Ala Ala Val Gin Gly Val Ala Ser 

245 250 

Met Val Cys Phe Leu Arg Val Pro Leu Leu 

255 260 

lie Val Asn Gly Phe Leu Val Leu lie Thr 

265 270 

Tyr Leu Gin His Thr His Pro Ser Leu Pro 

275 280 

His Tyr Asp Ser Ser Glu Trp Asp Trp Leu 

285 290 

Arg Gly Ala Leu Ala Thr Val Asp Arg Asp 

295 300 

Tyr Gly lie Leu Asn Gin Gly Phe His Asn 

305 310 

lie Thr Asp Thr His Glu Ala His His Leu 

315 320 

Phe Ser Thr Met Pro His Tyr His Ala Met 

325 330 

Glu Ala Thr Lys Ala lie Lys Pro lie Leu 

335 340 

Gly Glu Tyr Tyr Gin Phe Asp Gly Thr Pro 

34 5 3 50 

Val Val Lys Ala Met Trp Arg Glu Ala Lys 

355 360 

Glu Cys lie Tyr Val Glu Pro Asp Arg Gin 

365 370 
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Gly Glu Lys Lys Gly Val Phe Trp Tyr Asn 

375 380 

Asn Lys Leu Xaa 



(2) INFORMATION FOR SEQ ID NO : 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 09 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Ser Leu Leu Thr Ser Phe Ser Tyr Val Val 

5 "* 10 

Tyr Asp Leu Ser Phe Ala Phe lie Phe Tyr 

15 20 

lie Ala Thr Thr Tyr Phe His Leu Leu Pro 

25 30 

Gin Pro Phe Ser Leu lie Ala Trp Pro lie 

35 40 

Tyr Trp Val Leu Gin Gly Cys Leu Leu Thr 

45 50 

Arg Val Cys Gly His His Ala Phe Ser Lys 

55 60 

Tyr Gin Trp Val Asp Asp Val Val Gly Leu 

6 5 * 70 

Thr Leu His Ser Thr Leu Leu Val Pro Tyr 

75 80 

Phe Ser Trp Lys lie Ser His Arg Arg His 

8 5 ~ 90 

His Ser Asn Thr Gly Ser Leu Asp Arg Asp 

95 ~ 100 

Glu Arg Val Lys Val Ala Trp Phe Ser Lys 

105 110 

Tvr Leu Asn Asn Pro Leu Gly Arg Ala Val 

115 120 
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Ser Leu Leu Val Thr Leu Thr lie Gly Trp 

125 130 

Pro Met Tyr Leu Ala Phe Asn Val Ser Gly 

135 140 

Arg Pro Tyr Asp Ser Phe Ala Ser His Tyr 

145 150 

His Pro Tyr Arg Val Arg Leu Leu lie Tyr 

155 160 

Val Ser Asp Val Ala Leu Phe Ser Val Thr 

165 170 

Tyr Ser Leu Tyr Arg Val Ala Thr Leu Lys 

175 180 

Gly Leu Val Trp Leu Leu Cys Val Tyr Gly 

185 190 

Val Pro Leu Leu lie Val Asn Gly Phe Leu 

195 " 200 

Val Thr lie Thr Tyr Leu Arg Val His Tyr 

205 ~ 210 

Asp Ser Ser Glu Trp Asp Trp Leu Lys Gly 

215 220 

Ala Leu Ala Thr Met Asp Arg Asp Tyr Gly 

225 ~ 230 

lie Leu Asn Lys Val Phe His His lie Thr 

235 240 

Asp Thr His Val Ala His His Leu Phe Ser 

245 250 

Thr Met Pro His Tyr His Leu Arg Val Lys 

255 260 

Pro lie Leu Gly Glu Tyr Tyr Gin Phe Asp 

265 270 

Asp Thr Pro Phe Tyr Lys Ala Leu Trp Arg 

275 280 

Glu Ala Arg Glu Cys Leu Tyr Val Glu Pro 

285 290 
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Asp Glu Gly Thr Ser Glu Lys Gly Val Tyr 

295 300 

Trp Tyr Arg Asn Lys Tyr Leu Arg Val 

305 

(2) INFORMATION FOR SEQ ID NO : 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 02 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Phe Ser Tyr Val Val Tyr Asp Leu Thr lie 

5 * ~ 10 

Ala Phe Cys Leu Tyr Tyr Val Ala Thr His 

15 20 

Tyr Phe His Leu Leu Pro Gly Pro Leu Ser 

25 30 

Phe Arg Gly Met Ala lie Tyr Trp Ala Val 

35 40 

Gin Gly Cys lie Leu Thr Gly Val Trp Val 

4 5 50 

Val Ala Phe Ser Asp Tyr Gin Leu Leu Asp 

55 60 

Asp lie Val Gly Leu lie Leu His Ser Ala 

65 70 

Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr 

75 80 

Ser His Arg Arg His His Ser Asn Thr Gly 

85 90 

Ser Leu Glu Arg Asp Glu Val Phe Val Pro 

95 100 

Lvs Val Ser Lys Tyr Leu Asn Asn Pro Pro 
~ 105 HO 

Glv Arq Val Leu Thr Leu Ala Val Thr Leu 

115 120 
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Thr Leu Gly Trp Pro Leu Tyr Leu Ala Leu 

125 130 

Asn Val Ser Gly Arg Pro Tyr Asp Arg Phe 

135 140 

Ala Cys His Tyr Asp Pro Tyr Gly Pro He 

145 150 

Tvr Ser Val He Ser Asp Ala Gly Val Leu 
Y 155 160 

Ala Val Val Tyr Gly Leu Phe Arg Leu Ala 

165 170 

Met Ala Lys Gly Leu Ala Trp Val Val Cys 

175 180 

Val Tvr Gly Val Pro Leu Leu Val Val Asn 

185 190 

Glv Phe Leu Val Leu He Thr Phe Leu Gin 
3 200 

His Thr His Val Ser Glu Trp Asp Trp Leu 

205 21° 

Arg Gly Ala Leu Ala Thr Val Asp Arg Asp 

215 220 

Tyr Gly He Leu Asn Lys Val Phe His Asn 

225 230 

He Thr Asp Thr His Val Ala His His Leu 

23 5 240 

Phe Ser Thr Met Pro His Tyr His Ala Met 

245 2 50 

Glu Ala Thr Val Glu Tyr Tyr Arg Phe Asp 

255 260 

Glu Thr Pro Phe Val Lys Ala Met Trp Arg 

265 270 

Glu Ala Arg Glu Cys He Tyr Val Glu Pro 

275 280 

Asp Gin Ser Thr Glu Ser Lys Gly Val Phe 

285 290 
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Trp Tyr Asn Asn Lys Leu Ala Met Glu Ala 

295 300 

Thr Val 



(2) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

Met Gly Ala Gly Gly Arg Met Thr Glu Lys 

5 10 

Glu Arg Glu Lys Gin Glu Gin Leu Ala Arg 

15 20 

Ala Thr Gly Gly Ala Ala Met Gin Arg Ser 

25 30 

Pro Val Glu Lys Pro Pro Phe Thr Leu Gly 

35 40 

Gin lie Lys Lys Ala lie Pro Pro His Cys 

45 50 

Phe Glu Arg Ser Val Leu Lys Ser Phe Ser 

55 60 

Tyr Val Val His Asp Leu Val He Ala Ala 

65 70 

Ala Leu Leu Tyr Phe Ala Leu Ala He He 

75 80 

Pro Ala Leu Pro Ser Pro Leu Arg Tyr Ala 

85 90 

Ala Trp Pro Leu Tyr Trp He Ala Gin Gly 

95 100 

Ala Phe Ser Asp Tyr Ser Leu Leu Asp Asp 

105 ' 110 

Val Val Gly Leu Val Leu His Ser Ser Leu 

115 120 
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Met Val Pro Tyr Phe Ser Trp Lys Tyr Ser 

125 " ~ * 130 

His Arg Arg His His Ser Asn Thr Gly Ser 

135 140 

Leu Glu Arg Asp Glu Val Phe Val Pro Lys 

145 150 

Lys Lys Glu Ala Leu Pro Trp Tyr Thr Pro 

155 " 160 

Tyr Val Tyr Asn Asn Pro Val Gly Arg Val 

16 5 "* ~ 170 

Val His lie Val Val Gin Leu Thr Leu Gly 

175 180 

Trp Pro Leu Tyr Leu Ala Thr Asn Ala Ser 

185 190 

Gly Arg Pro Tyr Pro Arg Phe Ala Cys His 

195 200 

Phe Asp Pro Tyr Gly Pro lie Tyr Asn Asp 

205 210 

Arg Glu Arg Ala Gin lie Phe Val Ser Asp 

215 220 

Ala Gly Val Val Ala Val Ala Phe Gly Leu 

225 230 

Tyr Lys Leu Ala Ala Ala Phe Gly Val Trp 

235 240 

Trp Val Val Arg Val Tyr Ala Val Pro Leu 

245 250 

Leu lie Val Asn Ala Trp Leu Val Leu lie 

255 260 

Thr Tyr Leu Gin His Thr His Pro Ser Leu 

265 270 

Pro His Tyr Asp Ser Ser Glu Trp Asp Trp 

275 280 

Leu Arg Gly Ala Leu Ala Thr Met Asp Arg 

2 85 2 90 
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Asp Tyr Gly lie Leu Asn Arg Val Phe His 

295 ~ 300 

Asn lie Thr Asp Thr His Val Ala His His 

305 310 

Leu Phe Ser Thr Met Pro His Tyr His Ala 

315 320 

Met Glu Ala Thr Lys Ala lie Arg Pro lie 

325 330 

Leu Gly Asp Tyr Tyr His Phe Asp Pro Thr 

335 340 

Pro Val Ala Lys Ala Thr Trp Arg Glu Ala 

345 350 

Gly Glu Cys lie Tyr Val Glu Pro Glu Asp 

355 360 

Arg Lys Gly Val Phe Trp Tyr Asn Lys Lys 

3 65 ' ^3 70 

Phe Xaa 



(2) INFORMATION FOR SEQ ID NO: 11 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11. 

Trp Val Met Ala His Asp Cys Gly His His 

5 ^ iq 

Ala Phe Ser Asp Tyr Gin Leu Leu Asp Asp 

15 20 

Val Val Gly Leu lie Leu His Ser Cys Leu 

25 30 

Leu Val Pro Tyr Phe Ser Trp Lys His Ser 

35 40 

His Arg Arg His His Ser Asn Thr Glv Ser 

45 50 
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Leu Glu Arg Asp Glu Val Phe Val Pro Lys 

55 60 

Lys Lys Ser Ser lie Arg Trp Tyr Ser Lys 

65 ~ ~ 70 

Tyr Leu Asn Asn Pro Pro Gly Arg lie Met 

75 80 

Thr lie Ala Val Thr Leu Ser Leu Gly Trp 

85 90 

Pro Leu Tyr Leu Ala Phe Asn Val Ser Gly 

95 100 

Arg Pro Tyr Asp Arg Phe Ala Cys His Tyr 

105 110 

Asp Pro Tyr Gly Pro lie Tyr Asn Asp Arg 

115 ' 120 

Glu Arg lie Glu lie Phe lie Ser Asp Ala 

125 130 

Gly Val Leu Ala Val Thr Phe Gly Leu Tyr 

135 140 

Gin Leu Ala lie Ala Lys Gly Leu Ala Trp 

145 150 

Val Val Cys Val Tyr Gly Val Pro Leu Leu 

155 160 

Val Val Asn Ser Phe Leu Val Leu lie Thr 

165 170 

Phe Leu Gin His Thr His Pro Ala Leu Pro 

175 180 

His Tyr Asp Ser Ser Glu Trp Asp Trp Leu 

185 190 

Arg Gly Ala Leu Ala Thr Val Asp Arg Asp 

195 200 

Tyr Gly lie Leu Asn Lys Val Phe His Asn 

205 210 

lie Thr Asp Thr Gin Val Ala His His Leu 

215 220 
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Phe Thr Met Pro 



(2) INFORMATION FOR SEQ ID NO: 12 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 0 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GCTCTTTTGT GCGCTCATTC 2 0 



(2) INFORMATION FOR SEQ ID NO: 13 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
CGGT AC CAG A AAACGCCTTG 2 0 

(2) INFORMATION FOR SEQ ID NO : 14 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
TAYWSNCAYM GNMGNCAYCA 2 0 

(2) INFORMATION FOR SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



RTGRTGNGCN ACRTGNGTRT C 



21 
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WHAT IS CLAIMED IS : 

1 . A method of altering an amount of an 
unsaturated fatty acid in a seed of a plant 
comprising: decreasing a fatty acid desaturase 
activity in the seed by genetic manipulation of at 
least one of fatty acid desaturase or fatty acid 
hydroxylase . 

2. The method of Claim 1, wherein an 
endogenous gene for said fatty acid hydroxylase is 
mutated and thereby decreases fatty acid hydroxylase 
activity in the seed. 

3. The method of Claim 1, wherein said plant 
is transformed with a nucleic acid containing a 
sequence which encodes a fatty acid hydroxylase or 
derivative thereof . 

4. The method of Claim 3, wherein said 
derivative is a dominant negative mutant which 
thereby alters the amount of the unsaturated fatty 
acid in the seed. 

5. The method of Claim 3, wherein said 
derivative is a mutant fatty acid hydroxylase in 
which one or more essential histidine residues have 
been mutated which thereby alters the amount of the 
unsaturated fatty acid in the seed. 

6. The method of Claim 1, wherein an 
endogenous gene for said fatty acid desaturase is 
mutated and thereby decreases fatty acid desaturase 
activity in the seed. 
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7. The method of Claim 1, wherein said plant 
is transformed with a nucleic acid containing a 
sequence which encodes a fatty acid desaturase or 
derivative thereof . 

8. The method of Claim 7, wherein said 
derivative is a dominant negative mutant which 
thereby alters the amount of the unsaturated fatty 
acid in the seed. 

9. The method of Claim 7, wherein said 
derivative is a mutant fatty acid desaturase in 
which one or more essential histidine residues have 
been mutated which thereby alters the amount of the 
unsaturated fatty acid in the seed. 

10. The method of Claim 1, wherein said plant 
is selected from the group consisting of rapeseed, 
Craxribe, Brassica juncea, canola, flax, sunflower, 
saf flower, cotton, cuphea, soybean, peanut, coconut, 
oil palm and corn . 

11. A method of altering an amount of a 
unsaturated fatty acid comprising: 

(a) transforming a plant cell with a nucleic 
acid containing a sequence which encodes a fatty 
acid hydroxylase or a dominant negative mutant of 
fatty acid hydroxylase or a dominant negative mutant 
of fatty acid desaturase, 

(b) growing a seed-bearing plant from the 
transformed plant cell of step (a) , and 

(c) identifying a seed from the plant of step 
(b) with the altered amount of the unsaturated fatty 
acid in the seed. 
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12 . The method of Claim 11 , wherein said 
nucleic acid contains a sequence which encodes the 
dominant negative mutant of fatty acid hydroxylase 
in which one or more essential histidine residues 
have been mutated. 

13. The method of Claim 11, wherein said 
nucleic acid contains a sequence which encodes the 
dominant negative mutant of fatty acid hydroxylase 
which thereby alters the amount of the unsaturated 
fatty acid in the seed. 

14. The method of Claim 11, wherein said 
nucleic acid contains a sequence which encodes the 
dominant negative mutant of fatty acid desaturase in 
which one or more essential histidine residues have 
been mutated. 

15. The method of Claim 11, wherein said 
nucleic acid contains a sequence which encodes the 
dominant negative mutant of fatty acid desaturase 
which thereby alters the amount of the unsaturated 
fatty acid in the seed. 

16. The method of Claim 11, wherein said plant 
is selected from the group consisting of rapeseed, 
Crarnbe, Brass ica juncea, canola, flax, sunflower, 
saf flower, cotton, cuphea, soybean, peanut, coconut, 
oil palm and corn. 

17. A recombinant nucleic acid suitable for 
use in Claim 1, wherein said nucleic acid contains a 
sequence encoding a fatty acid hydroxylase with an 
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amino acid identity of 60% or greater to SEQ ID 
NO: 4 . 

18. The recombinant nucleic acid of Claim 17, 
wherein the amino acid identity is 90% or greater to 
SEQ ID NO: 4 . 

19. The recombinant nucleic acid of Claim 17, 
wherein the amino acid identity is 100% of SEQ ID 
NO: 4 . 

20. The recombinant nucleic acid of Claim 17, 
wherein said nucleic acid contains a sequence having 
a nucleotide identity of 90% or greater to SEQ ID 

NO : 1 , 2 or 3 . 

21. The recombinant nucleic acid of Claim 17, 
wherein said nucleic acid contains SEQ ID N0:1, 2 or 
3 . 

22. The recombinant nucleic acid of Claim 17, 
wherein said sequence is obtainable from a plant 
species producing a hydroxylated fatty acid. 

23. A recombinant nucleic acid suitable for 
use in Claim 1, wherein said nucleic acid contains a 
sequence encoding at least one of fatty acid 
desaturase or fatty acid hydroxylase. 

24. The recombinant nucleic acid of Claim 23, 
wherein said sequence is obtainable from Ricinus 
communis (L.) (castor). 
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25. The recombinant nucleic acid of Claim 23, 
wherein said sequence is obtainable from Lesquerella 
fendleri . 

26. The recombinant nucleic acid of Claim 23, 
wherein said nucleic acid contains a sequence 
encoding at least one of fatty acid desaturase or 
fatty acid hydroxylase in which one or more 
essential histidine residues have been mutated. 

27. The method of Claim 1 further comprising: 
processing the seed containing the altered amount of 
the unsaturated fatty acid to obtain oil and/or seed 
meal . 

28. Oil obtained by the method of Claim 27. 

29. Seed meal obtained by the method of Claim 

27 . 

30. Plant obtained by the method of Claim 1. 

31. The method of Claim 11 further comprising: 
processing the seed containing the altered amount of 
the unsaturated fatty acid to obtain oil and/or seed 
meal . 

32. Oil obtained by the method of Claim 31. 

33. Seed meal obtained by the method of Claim 

31 . 

34. Plant obtained by the method of Claim 11. 
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Ion #1: Mass 187 jcH 3 -(CH 2 ) 5 -CH-0-Si-(CH 3 )^j + 
Ion #2: Mass 299 

O 

j(CH 3 ) 3 -Si-0-CH-CH 2 -CH=CH-(CH 2 ) 7 -C-0-CH3 

Ion #3: Mass 270 (characteristic rearrangement ion) 

[cH 2 -CH=CH-(CH 2 ) 7 -C-0-CH^ 

O 

Si-(CH 3 ) 3 

Ion #4: Mass 185 (desaturated analog of Ion #1) 

jcH 3 -(CH 2 ) 2 -CH=CH-CH 2 -CH-0-Si-(CH 3 )^j + 

Ion #5: Mass 298 (elongated analog of Ion #3) 

[cH 2 -CH=CH(CH 2 ) 9 -C-0-CH 3 ^| + 

"~ O *" 

I 

Si-(CH 3 ) 3 

Ion #6: Mass 327 (elongated analog of ion 

O 

[TcH 3 ) 3 -Si-0-CH-CH 2 -CH=CH-(CH 2 ) 9 -C-0-CH^ + 
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Figure 3 
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Figure 4A Mass spectrum of peak 10 from figure 3B 
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Mass spectrum of peak 11 from figure 3B 
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Mass spectrum of peak 12 from figure 3B 
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30 


40 


TTCCAACAAT 


GGATCCCTAG 


90 


100 


CANATGGTAT 


GTCAAATACC 


150 


160 


GTTTATCCTC 


GGGTGGCCTT 


210 


220 


TTTCGCTTCA 


CATTTCTTCC 


270 


280 


ATACATCTCA 


GATGCTGGTA 
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340 


ACAAGGATTG 


ACTGCTATGA 


390 
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GTCACTTTCT 


450 


460 
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TGGATTAGAG 


510 
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CAAGGTGTTT 


CACAACATAA 
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AAAAAGATGA 


AGTCTTTGTC 
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TATCTTTAAG 
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TGCAGCACAC 


TCATCCTTCA 
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CAGACACCCA 


CGTAGCACAC 
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10 


20 


TATAGGCACC 


GGAGGCACCA 


70 


80 


CCAAAGCAGA 


AA7CCGCAAT 


130 


140 


ATCATGATGT 


TAACTGTCCA 


190 


200 


TCTGGCAGAC 


CCTACAATGG 
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260 


GACCGTGAAC 


GCCTCCAGAT 
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320 


CTTTACCGTT 


ACGCTGTTGC 
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CTTCTGATAG 


TTAACTTTTT 


430 
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TTGCCTCACT 


ATGATTCATC 
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GAATCTTGAA 
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GGCGAATACC 


150 


160 


GTTCGTCCTC 


GGATGGCCCT 


210 


220 


TTTCGCTTCC 


CATTTCTTCC 
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TATCTACAAC 
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CTGTTATGGT 
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TCTGTCTAAA 
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AT 


GAA 


GCT 


TTA 


~AA 


GAA 


2TT 


AGT 


TTT 


CTC 


TGG 


TGA 


CAG 


AGA 


AAT 


TNT 


47 


GTC 


AAT 


TGG 


TAG 


TGA 


CAG 


"TG 


AAG 


CAA 


CAG 


GAA 


CAA. 


CAA 


GGA 


TGG 


TTG 


95 


GTG 


NTG 


ATG 


CTG 


ATG 


TGG 


"GA 


TGT 


GTT 


ATT 


CAT 


CAA 


ATA 


CTA 


AAT 


ACT 


143 


ACA 


TTA 


CTT 


GTT 


GCT 


GCC 


"AC 


TTC 


TCC 


TAT 


TTC 


CTC 


CGC 


CAC 


CCA 


TTT 


191 


TGG 


ACC 


CAC 


GAN 


CCT 


TCC 


ATT 


TAA 


ACC 


CTC 


TCT 


CGT 


GCT 


ATT 


CAC 


CAG 


239 


AAG 


AGA 


AGC 


CAA 


GAG 


AGA 


GAG 


AGA 


GAG 


AAT 


GTT 


CTG 


AGG 


ATC 


ATT 


GTC 


287 


TTC 


TTC 


ATC 


GTT 


ATT 


AAC 


GTA 


AGT 


TTT 


TTT 


TGA 


CCA 


CTC 


ATA 


TCT 


AAA 


335 


ATC 


TAG 


TAC 


ATG 


CAA 


TAG 


ATT 


AAT 


GAC 


TGT 


TCC 


TTC 


TTT 


TGA 


TAT 


TTT 


383 


CAG 


CTT 


CTT 


GAA 


TTC 


AAG 


Met 
A.TG 


Gly 
GGT 


Al a 
GCT 


Gly 
GGT 


Gly 
GGA 


Arg 
AGA 


He 
ATA 


Met 
ATG 


Val 
GTT 


Thr 
ACC 


10 
431 


Pro 
CCC 


Ser 
TCT 


Ser 
TCC 


Lys 
AAG 


Lys 

AAA 


Ser 
TCA 


Glu 
GAA 


Thr 
ACT 


Glu 
GAA 


Ala 
GCC 


Leu 
CTA 


Lys 

AAA 


Arg 
CGT 


Gly 
GGA 


Pro 
CCA 


Cys 
TGT 


26 
479 


Glu 
GAG 


Lys 
AAA 


Pro 
CCA 


Pro 
CCA 


Phe 
TTC 


Thr 
ACT 


Val 
GTT 


Lys 

AAA 


Asp 
GAT 


Leu 
CTG 


Lys 
AAG 


Lys 
AAA 


Ala 
GCA 


lie 
ATC 


Pro 
CCA 


Gin 
CAG 


42 

527 


His 
CAT 


Cys 
TGT 


Phe 
TTC 


Lys 

AAG 


Arg 

CGC 


Ser 
TCT 


lie 
ATC 


Pro 
CCT 


Arg 
CGT 


Ser 
TCT 


Phe 
TTC 


Ser 
TCC 


Tyr 
TAC 


Leu 
CTT 


Leu 
CTC 


Thr 
ACA 


58 
575 


Asp 
GAT 


He 
ATC 


Thr 
ACT 


Leu 
TTA 


Val 
GTT 


Ser 
TCT 


Cys 
TGC 


Phe 
TTC 


Tyr 
TAC 


Tyr 
TAC 


Val 
GTT 


Ala 
GCC 


Thr 
ACA 


Asn 
AAT 


Tyr 
TAC 


Phe 
TTC 


74 

623 


Ser 
TCT 


Leu 
CTT 


Leu 
CTT 


Pro 
CCT 


Gin 
CAG 


Pro 
CCT 


Leu 
CTC 


Ser 
TCT 


Thr 
ACT 


Tyr 
TAC 


Leu 
CTA 


Ala 
GCT 


Trp 
TGG 


Pro 
CCT 


Leu 
CTC 


Tyr 
TAT 


90 
671 


Trp Val 
TGG GTA 


Cys 
TGT 


Gin 
CAA 


Gly Cys 
GGC TGT 


Val 
GTC 


Leu 
TTA 


Thr 
ACC 


Gly 
GGT 


He 
ATC 


Trp 
TGG 


Val 
GTC 


He 

ATT 


Gly 
GGC 


His 
CAT 


106 
719 


Glu Cys 
GAA TGT 


Gly 
GGT 


His 
CAC 


His 
CAT 


Ala 
GCA 


Phe 
TTC 


Ser 
AGT 


Asp 
GAC 


Tyr 
TAT 


Gin 
CAA 


Trp 
TGG 


Val 
GTA 


Asp 
GAT 


ASp 
GAC 


Thr 
ACT 


122 
767 


Val 
GTT 


Gly 
GGT 


Phe 
TTT 


lie 
ATC 


Phe 
TTC 


His 
CAT 


Ser 
TCC 


Phe 
TTC 


Leu 
CTT 


Leu 
CTC 


Val 
GTC 


Pro 
CCT 


Tyr 
TAC 


Phe 
TTC 


Ser 
TCC 


T rD 
TGG 


138 
815 


Lys 

AAA 


Tyr 
TAC 


Ser 
AGT 


His 
CAT 


Arg 
CGT 


Arg 
CGT 


His 
CAC 


His 
CAT 


Ser 
TCC 


Asn 
AAC 


Asn 
AAT 


Gly 
GGA 


Ser 
TCT 


Leu 
CTC 


Glu 
GAG 


Lys 

AAA 


154 
863 


Asp 
GAT 


Glu 
GAA 


Val 
GTC 


Phe 
TTT 


Val 
GTC 


Pro 
CCA 


Pro 

CCG 


Lys 

AAG 


Lys 

AAA 


Ala 
GCT 


Ala 
GCA 


Val 
GTC 


Lys 

AAA 


Trp 
TGG 


Tyr 
TAT 


Val 
GTT 


170 
911 


Lys 
AAA 


Tyr 
TAC 


Leu 
CTC 


Asn 
AAC 


Asn 

AAC 


Pro 
CCT 


Leu 
CTT 


Gly 
GGA 


Arg 

CGC 


He 
ATT 


Leu 
CTG 


Ve 1 
GTG 


Leu 
TTA 


Thr 
ACA 


Val 
GTT 


Gin 
CAG 


186 
959 
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Phe lie Leu Gly Trp Pro 
TTT ATC CTC GGG TGG CCT 

Pro Tyr Asp Gly Phe Ala 
CCT TAT GAT GGT TTC GCT 

Lys Asp Arg Glu Arg Leu 
AAA GAC CGA GAA CGC CTC 

Ala Val Cys Tyr Gly Leu 
GCT GTC TGT TAT GGT CTT 

Ala Met lie Cys Val Tyr 
GCT ATG ATC TGC GTC TAT 

Leu Val Leu Val Thr Phe 
CTT GTC TTG GTA ACT TTC 

Tyr Asp Ser Thr Glu Trp 
TAT GAT TCA ACC GAG TGG 

Asp Arg Asp Tyr Gly He 
GAC AGA GAC TAT GGA ATA 

Thr His Val Ala His His 
ACA CAT GTG GCT CAT CAT 

Met Glu Ala Thr Glu Ala 
ATG GAA GCT ACA GAG GCG 

Phe Asp Gly Thr Pro Trp 
TTC GAT GGA ACA CCG TGG 

Cys Leu Tyr Val Glu Pro 
TGT CTC TAT GTA GAA CCG 

Tyr Tyr Asn Asn Lys Leu 
TAT TAC AAC AAT AAG TTA 

ATC AAT CTT CAT TTC CAT 

TTT GTT TCA ATA ATC TCA 

TGC CTA GTT ATG TGG TGT 

GTG CTG CCC AGT GAA GAA 

AAT TGA CCA CAA NAT ATC 

AAA CCG GAT ATC CAA ATT 



Leu Tyr Leu Ala Phe Asn 
TTG TAT CTA GCC TTT AAT 

Ser His Phe Phe Pro His 
TCA CAT TTC TTC CCT CAT 

G1 n He Tyr lie Ser Asp 
CAG ATA TAC ATC TCA GAT 

Tyr Arg Tyr Ala Ala Ser 
TAC CGT TAC GCT GCT TCA 

Gly Val Pro Leu Leu lie 
GGA GTA CCG CTT TTG ATA 

Leu Gin His Thr His Pro 
TTG CAG CAC ACT CAT CCT 

Glu Trp I le Arg Gly Ala 
GAA TGG ATT AGA GGA GCT 

Leu Asn Lys Val Phe His 
TTG AAC AAG GTG TTC CAT 

Leu Phe Ala Thr lie Pro 
CTC TTT GCA ACT ATA CCG 

He Lys Pro He Leu Gly 
ATA AAG CCA ATA CTT GGT 

Tyr Val Ala Met Tyr Arg 
TAT GTG GCC ATG TAT AGG 

Asp Thr Glu Arg G1y Lys 
GAT ACG GAA CGT GGG AAG 



TGA GGC TGA TAG GGC GAG 
GTT TTA GGT GTC TTG TTT 
GAG TCC ATN TAG TTG TGT 
CGG AAG TTA GTG TTC AAA 
CAA GTT TAC GTG TTT AAA 
CAA AAC CGG CTA TCC GAA 
TCC AGA GTA CTT AG 



Val Ser Gly Arg 202 

GTA TCA GGT AGA 1007 

Ala Pro He Phe 218 

GCA CCT ATC TTT 1055 

Ala Gly He Leu 234 

GCT GGT ATT CTA 1103 

Gin Gly Leu Thr 250 

CAA GGA TTG ACT nsi 

Val Asn Phe Phe 266 

GTG AAC TTT TTC 1199 

Ser Leu Pro His 282 

TCG TTA CCT CAT 1247 

Leu Val Thr Val 298 

TTG GTT ACG GTA 1295 

Asn He Thr Asp 314 

AAC ATA ACA GAC 1343 

His Tyr Asn Ala 330 

CAT TAT AAC GCA 1391 

Asp Tyr Tyr His 346 

GAT TAC TAC CAC 1439 

Glu Ala Lys Glu 362 

GAA .GCA AAG GAG 1487 

Lys Gly Val Tyr 378 

AAA GGT GTC TAC 1535 

384 

AGA AGT GCA ATT 1583 

AAG AAG CTA TGC 1631 

TCT GGT GCA TTT 1679 

CTG CTT CCT GCT 1727 

ATA CTC GGA ACG 1775 

TTC CAT ATC CGA 1823 

1855 
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LFFAH12 
FAH12 
ATFAD2 
BNFAD2 
GMFAD2- 1 
6MFAD2- 2 
ZMFA02 
RCFAD2 



LFFAH12 51 

FAH12 51 

ATFAD2 51 

BNFA02 51 

GMFAD2 - 1 51 

GMFAD2-2 51 

ZMFA02 51 

RCFAD2 51 

LFFAH12 101 
FAH12 101 
ATFA02 101 
BNFAD2 101 
GMFAD2-1 101 
GMFAD2-2 101 
2MFAD2 101 
RCFAD2 101 



LFFAH12 

FAH12 

ATFAD2 

BNFA02 

GMFA02-1 

GMFAD2-2 

2MFA02 

RCFAD2 



151 
151 
151 
151 
151 
151 
151 
151 



LFFAH12 201 
FAH12 201 
ATFAD2 201 
BNFAD2 201 
GMFAD2-1 201 
GMFAD2-2 201 
ZMFAD2 201 
RCFAD2 201 



10 

MGAGGRIM- - 
MGGGGRMST V 
MGAGGRMP - - 
MGAGGRMQ - - 
MGLA - KETTM 

MGAGGR 

MGAGGRMTEK 



20 

- -VTPSSKKS 
I TSNNSEKKG 
- -VPTSSKKS 
- -VSPPSKKS 
GGRGRVAKVE 
TDVPPANRKS 
EREKQEQLAR 



30 

- - ETEALKRG 

- - GSSHLKRA 

- - ETDTTKRV 

- - ETON I KRV 
VQGKKPLSRV 

- - EVDPLKRV 
ATGGAAMQRS 



40 

PCEKPPFTVK 
PHTKPPFTLG 
PCEKPPFSVG 
PCETPPFTVG 
PNTKPPFTVG 
PFEKPQFSLS 
PVEKPPFTLG 



DLKKA 
OLKRA 
DLKKA 
ELKKA 
QLKKA 
OIKKA 
QIKKA 



50 
IPQHC 
IPPHC 
IPPHC 
IPPHC 
IPPHC 
IPPHC 
IPPHC 



60 

FKRSIPRSFS 
FERSFVRSFS 
FKRSIPRSFS 
FKRSIPRSFS 
FQRSLLTSFS 
FQRSV LRSFS 
FERSVLKSFS 



70 

YLLTOITLVS 
YVAYOVCLSF 
YLISOIIIAS 
HLIWDIIIAS 
YVVYDLSFAF 
YVVYOLTIAF 
YVVHOLVIAA 



80 

CFYYVATNYF 
LFYSIATNFF 
CFYYVATNYF 
CFYYVATTYF 
I FY- I ATTY F 
CLYYVATHY F 
ALLYFALAI I 



90 

SLLPQPLSTY 
PYISSPLS-Y 
SUPOPLS-Y 
PLLPNPLS-Y 
HLLPQPFS-L 
HLLPGPLS-F 
PALPSPLR-Y 



100 

LAMPLYWVCQ 
VAWLVYWLFQ 
LAWPLYWACQ 
FAWPLYWACO 
IAWPIYWVLO 
RGMA I YWAVQ 
AAWPLYW I AQ 



110 

GCVLTGIWVI 
GCILTGLWVI 
GCVLTGIWVI 
GCVLTGVWVI 
GCLLTGVWVI 
GCILTGVWVI 

G 

WVM 

160 

HHSNNGSLEK 
HHSNIGSLER 
HHSNTGSLER 
HHSNTGSLER 
HHSNTGSLDR 
HHSNTGSLER 
HHSNTGSLER 
HHSNTGSLER 
210 

YLAFNVSGRP 
YLAFNVSGRP 
YLAFNVSGRP 
YLAFNVSGRP 
YLAFNVSGRP 
YLALNVSGRP 
YLATNASGRP 
YLAFNVSGRP 



120 

GHECGHHAFS 
GHECGHHAFS 
AHECGHHAFS 
AHECGHAAFS 
AHECGHHAFS 
AHECGHHAFS 

AFS 

AHOCGHHAFS 
170 

DEVFVPPKKA 
OEVFVPKSKS 
OEVFVPKQKS 
DEVFVPR-RS 
DEVFVPKPKS 
OEVFVPKQKS 
OEVFVPKKKE 
DEVFVPKKKS 
220 

YDG-FASHFF 
YDR-FACHYD 
YDG-FACHFF 
YDGGFACHFH 
YDS- FASHYH 
YOR-FACHYO 
YPR-FACHFO 
YDR-FACHYD 



130 

DYQWVDDTVG 
EYOLADDIVG 
DYOWLDDTVG 
DYQWLDDTVG 
KYQWVDDVVG 
DYQLLDDIVG 
OYSLLDDVVG 
DYQLLODVVG 
180 

AVKWYVKYL- 
KISWYSKYS- 
AIKWYGKYL- 
OTSSGTAST- 
KVAWFSKYL- 
CIKWYSKYL- 
ALPWYTPYVY 
SI RWYSKYL - 
230 

PHAPI FKDRE 
PYGPI FSERE 
PNAPI YNDRE 
PNAPI YNDRE 
PYAPI YSNRE 
PYGPI YSDRE 
PYGPI YNDRE 
PYGPI YNDRE 



140 

FI FHSFLLVP 
LIVHSALLVP 
LI FHSFLLVP 
LI FHSFLLVP 
LTLHSTLLVP 
LI LHSALLVP 
LVLHSSLMVP 
LILHSCLLVP 
190 

NNPLGRI LVL 
NNPPGRVLTL 
NNPLGR JMML 
STTFGRTVML 
NNPLGRAVSL 
NNPPGRVLTL 
NNPVGRVVHI 
NNPPGRIMTI 
240 

RLQI YISDAG 
RLOI YIADLG 
RLOI YLSDAG 
RLOIYISDAG 
RLLI YVSDVA 
RLOIYISDAG 
RAQI FVSDAG 
RIEIFISDAG 



150 

YFSWKYSHRR 
YFSWKYSHRR 
YFSWKYSHRR 
YFSWKYSHRR 
YFSWKISHRR 
YFSWKYSHRR 
YFSWKYSHRR 
YFSWKHSHRR 
200 

TVQFI LGWPL 
AATLLLGWPL 
TVOFVLGWPL 
TVOFTLGWPL 
LVTLTIGWPM 
AVTLTLGWPL 
VVOLTLGWPL 
AVTLSLGWPL 
250 

I LAVCYGLYR 
IFATTFVLYO 
I LAVC FGLYR 
I LAVCYGLLP 
LFSVTYSLYR 
VLAVVYGLFR 
VVAVAFGLYK 
VLAVTFGLYQ 



50 
50 
50 
50 
50 
50 
50 
50 

100 
100 
100 
100 
100 
100 
100 
100 

150 
150 
150 
150 
150 
150 
150 
150 

200 
200 
200 
200 
200 
200 
200 
200 

250 
250 
250 
250 
250 
250 
250 
250 
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260 

C \J \J 


LFFAH12 


251 


YAAS06I TAM 


FAH12 


251 


- T MA KG LAW V 


ATFAD2 


251 


Y AAAOGMASM 


BNFAD2 


251 


Y AAVOGVASM 


GMFAD2- 1 


251 


V ATLKGLVWL 


GHFAD2-2 


251 


LAMAKGLAWV 


ZMFA02 


251 


L A AAFGVWWV 

LnAATU VWWV 


RCFAD2 


251 


LA T AKGI AUV 








LFFAH12 


301 


P a t vt i/nun y 

tn L V 1 VUf\U I 


FAH12 


301 

Jul 


£ AMVTVnRHY 

V3/M 1 Y 1 V Uf\U T 


ATFAD2 


301 


Pa i ATvnpn y 


BNFAD2 


301 

Jul 


n a i at \i npn y 


GMFAD2-1 


301 


GA LATMDRnY 


GMFAD2-2 


301 


GALATVDRDY 


ZMFAD2 


301 


GALATMDRDY 


RCFA02 


301 


GALATVDRDY 






360 


LFFAH12 


351 


DYYHFDGTPW 


FAH12 


351 


EYYRYOGTPF 


ATFAD2 


351 


DYYQFOGTPW 


BNFAD2 


351 


EYYQF0GTPV 


6MFAD2- 1 


351 


EY YQFDDTPF 


6MFAD2- 2 


351 


EYYRFDETPF 


ZMFAD2 


351 


DYYHFDPTPV 



270 280 
1CVYGVPLLI VNFFLVLVTF 
MRI YGVPLLI VNCFLVMITY 
ICLYGVPLLI VNAFLVL I TY 
VCFLRVPLLI VNGFLVLITY 
LCVYGVPLLI VNGFLVT ITY 
VCVYGVPLLV VNGFLVLITF 
VRVYAVPLLI VNAWLVLITY 
VCVYGVPLLV VNSFLVLITF 



320 330 
GI LNKVFHNI TDTHVAHHLF 
GVLNKVFHNI ADTHVAHHLF 
61 LNKVFHNI TDTHVAHHLF 
GI LNQGFHNI TDTHEAHHLF 
GI LNKVFHHI TDTHVAHHLF 
GILNKVFHNI TDTHVAHHLF 
GI LNRVFHNI TDTHVAHHLF 
GILNKVFHNI TDTQVAHHLF 

370 380 
YVAMYREAKE CLYVEPDTER 
YKALWREAKE CLFVEPDEGA 
YVAMYREAKE C I YVEPDREG 
VKAMWREAKE CIYVEPORQG 
YKALWREARE CLYVEPDEGT 
VKAMWREARE CIYVEPDOST 
AKATWREAGE CIYVEPE--- 



290 300 
LQHTHPSLPH YDSTEWEWIR 
LQHTHPA I PR YGSSEWDWLR 
LOHTHPSLPH YDSSEWDWLR 
LQHTHPSLPH YDSSEWDWLR 
LOHTHFALPH YDSSEWDWLK 
LOHTHPALPH YTSSEWDWLR 
LOHTHPSLPH YDSSEWDWLR 
LOHTHPALPH YDSSEWDWLR 



340 350 
ATI PHYNAME ATEAIKPILG 
ATV PHYHAME ATKAI KPIMG 
STMPHYNAME ATKAI KPILG 
STNPHYHAME ATKAI KPILG 
STMPHYHAME ATNAI KPILG 
STMPHYHAME ATKAI KPILG 
STMPHYHAME ATKAI RPILG 



390 400 

GKKGVYYYNN K-L 

PTQGVFWYRN KY- 

DKKGVYWYNN K-L 

EKKGVFWYNN KL* 

SEKGVYWYRN KY- 

ESKGVFWYNN KL- 

DRKGV FWYNK KF* 
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Plasmid name: pSLJ44026 
Plasmid size: 25.70 kb 
Constructed by: Jonathon Jones 
Construction date: 1992 

Comments/References: Transgenic Research 1,285-297 (1992) 
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